Explore All AI Models
From image generation to voice synthesis to video transcription — deAPI gives you a single, unified API to access the best open-source models.
Text-to-Speech
Turn any text into lifelike, natural-sounding audio. Multiple voices and languages for accessibility, e-learning, and content creation.
Qwen3 TTS CustomVoice
Text-to-Speech9 premium voices (male & female). 10 languages: EN, ZH, JA, KO, DE, FR, RU, PT, ES, IT. Instruction-driven emotion & prosody control. Streaming with 97ms latency. Output: MP3 at 24kHz.
Sample Output
Sample Prompt
The year is twenty forty-five. Humanity has finally cracked the code on fusion energy, and the world will never be the s...
The year is twenty forty-five. Humanity has finally cracked the code on fusion energy, and the world will never be the same. Cities float above the clouds, powered by reactors no bigger than a suitcase. But not everyone is celebrating. Deep beneath the ocean floor, something ancient has awakened — something that doesn't appreciate all the noise we've been making. This is the story of first contact, and it doesn't start with a handshake.
Qwen3 TTS VoiceDesign
Text-to-SpeechDesign a unique voice from a text description — define tone, accent, age, and emotion without any audio samples.
Sample Output
Sample Text
Welcome to the future of AI. What once took hours now takes seconds. Generate images, music, video, and speech, all from a single API. This is deAPI.
Voice Description
A deep, confident male voice with a smooth cinematic narrator tone. Calm yet powerful delivery, like a premium tech product reveal. Slight warmth, authoritative, inspiring trust.
Qwen3 TTS VoiceClone
Text-to-SpeechClone voice from a short audio sample — reproduce its tone, accent, and character in any new text.
Sample Output
Reference Text
Welcome to the future of AI. What once took hours now takes seconds. Generate images, music, video, and speech, all from a single API. This is deAPI.
Target Text
The possibilities are endless. From a single line of text, create anything you can imagine. Voices that feel real. Music that moves you. Images that inspire. Welcome to the new era of creation.
Voice File
Chatterbox
Text-to-Speech23 languages: EN, ES, FR, DE, ZH, JA, KO, AR, HI, PL and more. Emotion exaggeration control. Built-in AI watermarking. Output: MP3/WAV/FLAC.
Sample Output
Sample Prompt
Imagine a voice so natural, so expressive, that you forget it's AI. That's exactly what Chatterbox delivers. Built for d...
Imagine a voice so natural, so expressive, that you forget it's AI. That's exactly what Chatterbox delivers. Built for developers, creators, and builders who refuse to compromise on quality. From silky-smooth narration to high-energy hype — Chatterbox handles every emotion, every accent, every style. And the best part? It's completely open source. No black boxes. No limits. Just pure, next-level voice generation. Welcome to the new standard in speech AI. Welcome to Chatterbox.
Kokoro
Text-to-Speech41 voices (male & female). 7 languages: EN-US, EN-GB, ES, FR-FR, HI, IT, PT-BR. Output: MP3/FLAC at 24kHz.
Sample Output
Sample Prompt
Artificial intelligence is revolutionizing various industries and daily life. From healthcare diagnostics to autonomous...
Artificial intelligence is revolutionizing various industries and daily life. From healthcare diagnostics to autonomous vehicles, AI systems are transforming how we work, communicate, and solve complex problems.