model release

Google DeepMind Releases Gemini 3.5 Live Translate for Real-Time Speech Translation Across 70+ Languages

TL;DR

Google DeepMind released Gemini 3.5 Live Translate, an audio model that provides near real-time speech-to-speech translation across 70+ languages. The model automatically detects languages, preserves speaker intonation and pacing, and maintains a few seconds of latency while generating continuous speech output.

June 9, 2026 · 3:35 PM2 min read

Gemini 3.5 Live Translate — Quick Specs

Compare Gemini 3.5 Live Translate with other models →

Google DeepMind Releases Gemini 3.5 Live Translate for Real-Time Speech Translation Across 70+ Languages

Google DeepMind released Gemini 3.5 Live Translate on June 9, 2026, an audio model that provides near real-time speech-to-speech translation across 70+ languages with automatic language detection.

Technical Capabilities

The model generates continuous translated speech while maintaining a latency of "just a few seconds" behind the speaker, according to Google. Unlike turn-based translation systems that wait for complete sentences, Gemini 3.5 Live Translate processes streaming audio and balances translation speed with contextual accuracy.

Key technical features include:

Automatic detection of 70+ languages without manual configuration
Preservation of speaker intonation, pacing, and pitch in translated output
Noise robustness for unpredictable environments
Support for over 2,000 language pair combinations in single sessions
SynthID watermarking embedded in all generated audio

Availability and Deployment

Gemini 3.5 Live Translate is rolling out across three channels:

Gemini Live API: Available in public preview for developers via Google AI Studio. Developer platforms including Agora, Fishjam, LiveKit, Pipecat, and Vision Agents have integrated the API for real-time media streaming infrastructure.

Google Meet: Launching in private preview this month for select Google Workspace business customers, expanding from the previous limitation of five languages and English-only translation pairs. Broader rollout planned for later in 2026.

Google Translate app: Rolling out globally on Android and iOS. The model powers the Live translate feature for users with connected headphones. Android users receive an additional "listening mode" that streams translations through the phone's earpiece without headphones.

Early Implementations

Grab, which processes over 10 million voice calls monthly, is testing the model to enable multilingual communication between drivers and travelers. Additional partners including CJ ENM and LiveKit have provided feedback on translation quality and low latency, according to Google.

Pricing for API access has not been disclosed.

What This Means

Gemini 3.5 Live Translate represents Google's entry into the competitive real-time speech translation market, directly challenging established players in multilingual communication tools. The 70+ language support and 2,000+ language pair combinations significantly exceed the capabilities of Google's previous Meet translation system, which supported only five languages with English as a required pivot.

The model's continuous streaming approach addresses a core limitation of turn-based systems, though the "few seconds" latency specification lacks precision for developers evaluating real-time requirements. The integration across Google's product ecosystem—from developer APIs to consumer apps—indicates a platform play rather than a standalone model release. However, the lack of disclosed API pricing and benchmark comparisons to competing speech translation models limits technical evaluation.

Source: deepmind.google ↗

Gemini Google DeepMind speech translation audio models real-time translation multilingual AI Google Meet Google Translate

model releaseJuly 24, 2026

Google Releases Gemini Omni Flash Preview, a Multimodal Model for 720p Video Generation

Google has released Gemini Omni Flash Preview, a native multimodal model that generates short 720p videos with native audio from text, image, and video inputs. The model is available now via OpenRouter with a 131K token context window.

model releaseJuly 24, 2026

Google Launches Gemini Nano 4 and Gemini Intelligence on Samsung's Galaxy Z Fold 8, Flip 8

Samsung's Galaxy Z Fold 8, Fold 8 Ultra, and Flip 8 are the first devices to ship with Google's Gemini Nano 4 on-device model and the new Gemini Intelligence feature tier. The launch comes with strict hardware requirements including 12GB+ RAM and qualified system-on-chips.

model releaseJuly 24, 2026

Laguna S 2.1 Launches: Startup Claims Cheaper-Than-DeepSeek Pricing and Better-Than-V4-Pro Performance

A new Western AI lab has released Laguna S 2.1, a model that Reddit users and early testers describe as cheaper than DeepSeek V4 Flash while outperforming DeepSeek V4 Pro. Pricing, benchmark scores, and context window details remain undisclosed as of publication.

model releaseJuly 24, 2026

Black Forest Labs Unveils FLUX.2 [klein]: A Distilled Model for Interactive Image Generation

Black Forest Labs has released FLUX.2 [klein], a lightweight variant of its FLUX.2 image generation model family designed for faster, more interactive use. The company frames the release as a step toward 'interactive visual intelligence,' though detailed benchmarks and pricing have not yet been disclosed.

Google DeepMind Releases Gemini 3.5 Live Translate for Real-Time Speech Translation Across 70+ Languages

Gemini 3.5 Live Translate — Quick Specs

Google DeepMind Releases Gemini 3.5 Live Translate for Real-Time Speech Translation Across 70+ Languages

Technical Capabilities

Availability and Deployment

Early Implementations

What This Means

Related Articles

Google Releases Gemini Omni Flash Preview, a Multimodal Model for 720p Video Generation

Google Launches Gemini Nano 4 and Gemini Intelligence on Samsung's Galaxy Z Fold 8, Flip 8

Laguna S 2.1 Launches: Startup Claims Cheaper-Than-DeepSeek Pricing and Better-Than-V4-Pro Performance

Black Forest Labs Unveils FLUX.2 [klein]: A Distilled Model for Interactive Image Generation

Comments