speech translation

2 articles tagged with speech translation

June 9, 2026
model release

Google DeepMind Releases Gemini 3.5 Live Translate for Real-Time Speech Translation Across 70+ Languages

Google DeepMind released Gemini 3.5 Live Translate, an audio model that provides near real-time speech-to-speech translation across 70+ languages. The model automatically detects languages, preserves speaker intonation and pacing, and maintains a few seconds of latency while generating continuous speech output.

May 7, 2026
model releaseOpenAI

OpenAI releases GPT-Realtime-2 reasoning voice model with two specialized variants for translation and transcription

OpenAI has released three new realtime voice models through its Realtime API: GPT-Realtime-2 with GPT-5-class reasoning capabilities, GPT-Realtime-Translate supporting 70 input languages, and GPT-Realtime-Whisper for streaming transcription. The models are priced at $32-64 per 1M audio tokens for GPT-Realtime-2, and $0.017-0.034 per minute for the specialized variants.