Qwen3 TTS CustomVoice
9 premium voices (male & female). 10 languages: EN, ZH, JA, KO, DE, FR, RU, PT, ES, IT. Instruction-driven emotion & prosody control. Streaming with 97ms latency. Output: MP3 at 24kHz.
High-quality TTS in dozens of voices and languages.
Uploading…
Estimated cost: calculating… $0.00248/clip
2 free clips left
{
"mode": "voice_clone",
"model": "Qwen3_TTS_12Hz_1_7B_Base",
"text": "The possibilities are endless. From a single line of text, create anything you can imagine. Voices that feel real. Music that moves you. Images that inspire. Welcome to the new era of creation.",
"lang": "English",
"speed": 1,
"format": "mp3",
"sample_rate": 24000,
"ref_audio": "(binary file)",
"ref_text": "Welcome to the future of AI. What once took hours now takes seconds. Generate images, music, video, and speech, all from a single API. This is deAPI."
}
curl -X POST \
'https://api.deapi.ai/api/v1/client/txt2audio' \
-H 'Accept: application/json' \
-H 'Authorization: Bearer {{ YOUR_API_TOKEN }}' \
-F 'mode=voice_clone' \
-F 'model=Qwen3_TTS_12Hz_1_7B_Base' \
-F 'text=The possibilities are endless. From a single line of text, create anything you can imagine. Voices that feel real. Music that moves you. Images that inspire. Welcome to the new era of creation.' \
-F 'lang=English' \
-F 'speed=1' \
-F 'format=mp3' \
-F 'sample_rate=24000' \
-F 'ref_audio=@/path/to/reference.wav' \
-F 'ref_text=Welcome to the future of AI. What once took hours now takes seconds. Generate images, music, video, and speech, all from a single API. This is deAPI.'
Built for developers who need power without complexity.
9 premium voices (male & female). 10 languages: EN, ZH, JA, KO, DE, FR, RU, PT, ES, IT. Instruction-driven emotion & prosody control. Streaming with 97ms latency. Output: MP3 at 24kHz.
Design a unique voice from a text description — define tone, accent, age, and emotion without any audio samples.
23 languages: EN, ES, FR, DE, ZH, JA, KO, AR, HI, PL and more. Emotion exaggeration control. Built-in AI watermarking. Output: MP3/WAV/FLAC.
41 voices (male & female). 7 languages: EN-US, EN-GB, ES, FR-FR, HI, IT, PT-BR. Output: MP3/FLAC at 24kHz.
Join developers & vibe coders building AI apps with deAPI. No credit card, zero setup headaches — just pure, instant AI magic.
Need enterprise pricing or SLA? Talk to Sales
Confirm Generation
This amount will be deducted from your account balance.
Enhance Prompt
Confirm to apply the enhanced prompt and deduct the listed credits from your balance.
You've reached your free daily limit
Your counter resets at 00:00 UTC.
Want to keep going? Register now and get $5 in free credits to start.