Explore All AI Models

From image generation to voice synthesis to video transcription — deAPI gives you a single, unified API to access the best open-source models.

Text-to-Speech

Turn any text into lifelike, natural-sounding audio. Multiple voices and languages for accessibility, e-learning, and content creation.

Qwen3 TTS CustomVoice

Text-to-Speech

9 premium voices (male & female). 10 languages: EN, ZH, JA, KO, DE, FR, RU, PT, ES, IT. Instruction-driven emotion & prosody control. Streaming with 97ms latency. Output: MP3 at 24kHz.

Sample Output

Sample Prompt

The year is twenty forty-five. Humanity has finally cracked the code on fusion energy, and the world will never be the s... The year is twenty forty-five. Humanity has finally cracked the code on fusion energy, and the world will never be the same. Cities float above the clouds, powered by reactors no bigger than a suitcase. But not everyone is celebrating. Deep beneath the ocean floor, something ancient has awakened — something that doesn't appreciate all the noise we've been making. This is the story of first contact, and it doesn't start with a handshake.

Qwen3 TTS VoiceDesign

Text-to-Speech

Design a unique voice from a text description — define tone, accent, age, and emotion without any audio samples.

Sample Output

Sample Text

Welcome to the future of AI. What once took hours now takes seconds. Generate images, music, video, and speech, all from a single API. This is deAPI.

Voice Description

A deep, confident male voice with a smooth cinematic narrator tone. Calm yet powerful delivery, like a premium tech product reveal. Slight warmth, authoritative, inspiring trust.

Qwen3 TTS VoiceClone

Text-to-Speech

Clone voice from a short audio sample — reproduce its tone, accent, and character in any new text.

Sample Output

Reference Text

Welcome to the future of AI. What once took hours now takes seconds. Generate images, music, video, and speech, all from a single API. This is deAPI.

Target Text

The possibilities are endless. From a single line of text, create anything you can imagine. Voices that feel real. Music that moves you. Images that inspire. Welcome to the new era of creation.

Voice File

Reference audio provided

Chatterbox

Text-to-Speech

23 languages: EN, ES, FR, DE, ZH, JA, KO, AR, HI, PL and more. Emotion exaggeration control. Built-in AI watermarking. Output: MP3/WAV/FLAC.

Sample Output

Sample Prompt

Imagine a voice so natural, so expressive, that you forget it's AI. That's exactly what Chatterbox delivers. Built for d... Imagine a voice so natural, so expressive, that you forget it's AI. That's exactly what Chatterbox delivers. Built for developers, creators, and builders who refuse to compromise on quality. From silky-smooth narration to high-energy hype — Chatterbox handles every emotion, every accent, every style. And the best part? It's completely open source. No black boxes. No limits. Just pure, next-level voice generation. Welcome to the new standard in speech AI. Welcome to Chatterbox.

Kokoro

Text-to-Speech

41 voices (male & female). 7 languages: EN-US, EN-GB, ES, FR-FR, HI, IT, PT-BR. Output: MP3/FLAC at 24kHz.

Sample Output

Sample Prompt

Artificial intelligence is revolutionizing various industries and daily life. From healthcare diagnostics to autonomous... Artificial intelligence is revolutionizing various industries and daily life. From healthcare diagnostics to autonomous vehicles, AI systems are transforming how we work, communicate, and solve complex problems.