Voxtral TTS

Mistral AI๐Ÿ‡ซ๐Ÿ‡ท France
active
Input / 1M tokens$16000

Version History

1.0major

Initial release of Voxtral TTS, Mistral's first text-to-speech model with 4B parameters supporting 9 languages and voice cloning from minimal audio samples.

Coverage

model releaseMistral AI

Mistral Releases Voxtral TTS: 4B Parameter Text-to-Speech Model at $0.016 per 1k Characters

Mistral AI has released Voxtral TTS, a 4B parameter text-to-speech model supporting 9 languages including English, French, German, Spanish, Dutch, Portuguese, Italian, Hindi, and Arabic. The model achieves 70ms latency for typical inputs and can clone voices from as little as 3 seconds of audio, priced at $0.016 per 1,000 characters.

2 min read
Voxtral TTS โ€” AI Model Card | TPS