Audio Models

Enhance your soundscapes with our innovative audio models that produce crystal-clear, immersive audio experiences. Perfect for music production, podcasts, and digital media enhancement.

Text To Audio

Dia (Text to Speech)

Dia by Nari Labs is an advanced open-weights TTS model that brings scripts to life with natural speech, emotions, and nonverbal cues. Easily control tone, voice, and delivery. Great alternative to ElevenLabs.

47.0s
a day ago
Text To Audio

Minimax Music-01

Generate up to 60 seconds of music with both accompaniment and vocals in a single pass, with vocals from lyrics and a reference track.

42.3s
a month ago
Text To Audio

3B Orpheus TTS (0.1)

Orpheus TTS is an open-source text-to-speech (TTS) system powered by the Llama 3B language model, designed for high-quality and customizable speech synthesis.

47.2s
a month ago
Text To Audio

Meta MusicGen Medium

MusicGen: Transform text into music with AI. Create unique, high-quality audio from simple descriptions. Experience the future of music generation with this innovative AI model.

21.2s
7 months ago
Text To Audio

MyShell Text To Speech

MyShell's Voice Cloning and Text to Speech - Transform your audio content with realistic, personalized voices. Experience high-quality, efficient, and cost-effective audio synthesis.

6.9s
7 months ago
Text To Audio

Openvoice

OpenVoice is a versatile voice cloning model that supports multiple languages and offers precise tone replication, flexible style control, and zero-shot cross-lingual capabilities

8.0s
7 months ago
Text To Audio

ElevenLabs Dubbing

ElevenLabs Dubbing uses AI to translate your audio into multiple languages. Easily create multilingual versions of your content without studios or voice actors for each language

91.7s
9 months ago
Text To Audio

Elevenlabs Sound Generation

Eleven Labs' Sound Generation API provides a robust development tool for programmatically generating audio content using artificial intelligence. This API empowers developers and creators to integrate sound generation functionalities into their applications and workflows.

7.8s
10 months ago
Audio To Audio

Elevenlabs Speech To Speech

Eleven Labs Speech-to-Speech offers AI-powered voice conversion for content creators, media professionals, and anyone seeking to modify or translate audio speech.

6.3s
10 months ago
Text To Audio

Elevenlabs Text To Speech

Eleven Labs Text-to-Speech (TTS) harnesses the power of deep learning to create realistic and engaging synthetic speech from written text.

11.3s
10 months ago

Cookie settings

We use cookies to enhance your browsing experience, analyze site traffic, and personalize content. By clicking "Accept all", you consent to our use of cookies.