Gemini 3.1 Flash TTS

Name: Gemini 3.1 Flash TTS
Author: Google DeepMind

Google DeepMind🇺🇸 United States

active

Compare with other models →

Version History

3.1minorApril 15, 2026

Gemini 3.1 Flash TTS introduces audio tags for granular control over vocal style, pace, and delivery through natural language commands. The model achieves an Elo score of 1,211 and includes mandatory SynthID watermarking.

Coverage

model releaseGoogle DeepMind

Google DeepMind releases Gemini 3.1 Flash TTS with audio tags for precise speech control across 70+ languages

Google DeepMind launched Gemini 3.1 Flash TTS, a text-to-speech model that achieved an Elo score of 1,211 on the Artificial Analysis TTS leaderboard. The model introduces audio tags that allow developers to control vocal style, pace, and delivery through natural language commands embedded in text input, with support for 70+ languages.

April 15, 2026 · 4:21 PM2 min read

text-to-speech google-deepmind gemini