Chatterbox Multilingual is our production-grade, open-source TTS model with expressive emotion control, PerTh watermarking, and zero-shot voice cloning — free to download, free to build on.
Users expect apps and agents to sound human, speak in their native language, and deliver content with authentic tone and emotion. Chatterbox Multilingual was built to meet that demand.
23 supported languages from launch — Arabic, Chinese, English, French, German, Hindi, Japanese, Korean, Spanish and more — with zero-shot voice cloning across every one.
Fine-tune delivery with emotion and intensity settings. Dial in warmth, urgency, or calm to match the moment — not just the text.
Ultra-stable inference with PerTh watermarking baked into every output, so every voice you generate is traceable and authenticated at creation.
Whether you’re designing a voice agent for customer support, a language-learning app, or a global gaming experience, Chatterbox Multilingual ships with native support for the languages your users actually speak.
A handful of multilingual samples generated directly from Chatterbox Multilingual — same model, same voice prompt, four different languages.
Generate your own samples in the Resemble app or run the model locally from Hugging Face.
Pull the weights straight from Hugging Face and generate production-quality speech in any supported language. No license key, no rate limits, no platform lock-in.
Chatterbox Multilingual ships as a standard Python package with first-class PyTorch and torchaudio support. Run it on your own GPU, deploy it on your own infra, or layer it into an existing pipeline — the only constraint is your imagination.
Built-in PerTh watermarking means every clip you generate is invisibly tagged at creation, so downstream detection stays possible even after re-encoding.
import torchaudio as ta from chatterbox.tts import ChatterboxTTS model = ChatterboxTTS.from_pretrained( repo_id="ResembleAI/chatterbox-multilingual", device="cuda" # or "cpu" ) text = "Bienvenue dans Chatterbox Multilingual." wav = model.generate(text, lang="fr") ta.save("sample_fr.wav", wav, model.sr)
The open-source release brings world-class TTS to everyone. For regulated industries and production workloads, Chatterbox Multilingual Pro closes the last mile with fine-tuning, SLAs, and low-latency infrastructure.
Everything you need to know before building with Chatterbox Multilingual.
device="cuda" or device="cpu" in from_pretrained.