speech-to-speech

4 articles tagged with speech-to-speech

June 9, 2026
product update

Google launches Gemini 3.5 Live Translate with continuous speech-to-speech in 70+ languages

Google announced Gemini 3.5 Live Translate, a speech-to-speech translation model supporting over 70 languages with continuous audio generation. The model rolls out today to Google Translate on Android and iOS, with Google Meet integration coming in private preview this month for select Workspace customers.

June 8, 2026
product updateAmazon Web Services

AWS releases open-source test harness for evaluating Amazon Nova Sonic voice agents at scale

Amazon has released an open-source testing framework for Nova Sonic voice agents that automates multi-turn conversation evaluation without requiring human testers. The harness uses LLM-as-judge techniques to assess voice agents across six metrics including goal achievement, response accuracy, and tool usage, addressing a critical QA bottleneck in voice AI development.

May 13, 2026
product updateAmazon Web Services

AWS Launches WebRTC Integration for Amazon Nova Sonic Real-Time Voice Streaming

AWS has integrated WebRTC protocol support with Amazon Nova Sonic, its speech-to-speech model, through Amazon Kinesis Video Streams. The integration delivers real-time voice streaming with sub-second latency and includes adaptive bitrate control, forward error correction, and Voice Activity Detection for mobile and IoT applications.

April 28, 2026
product updateAmazon Web Services

Amazon Nova 2 Sonic Unifies Speech Recognition, Reasoning, and TTS in Single Streaming Model

Amazon Web Services released technical guidance for migrating text agents to voice assistants using Amazon Nova 2 Sonic, a native speech-to-speech model that combines automatic speech recognition, reasoning, tool calling, and text-to-speech in a single bidirectional streaming interface. The model supports asynchronous tool calling and built-in voice activity detection for handling interruptions.