Voxtral TTS

Name: Voxtral TTS
Price: 16000 USD
Author: Mistral AI

Mistral AI🇫🇷 France

active

Compare with other models →

Input / 1M tokens$16000

Version History

1.0majorJune 18, 2026

Initial release of Voxtral TTS, Mistral's first text-to-speech model with 4B parameters supporting 9 languages and voice cloning from minimal audio samples.

Coverage

model releaseMistral AI

Mistral Releases Voxtral TTS: 4B Parameter Text-to-Speech Model at $0.016 per 1k Characters

Mistral AI has released Voxtral TTS, a 4B parameter text-to-speech model supporting 9 languages including English, French, German, Spanish, Dutch, Portuguese, Italian, Hindi, and Arabic. The model achieves 70ms latency for typical inputs and can clone voices from as little as 3 seconds of audio, priced at $0.016 per 1,000 characters.

June 18, 2026 · 9:07 AM2 min read

mistral-ai text-to-speech voice-cloning

model release

Mistral releases Voxtral TTS, open-source speech model for enterprise voice agents

Mistral AI released Voxtral TTS, an open-source text-to-speech model designed for enterprise voice agents and edge devices. The model supports nine languages, adapts custom voices from samples under five seconds, and achieves 90ms time-to-first-audio latency with a 6x real-time factor.

March 26, 2026 · 11:35 AM2 min read

mistral-ai text-to-speech open-source