speech-to-speech

5 articles tagged with speech-to-speech

June 24, 2026

Loka Achieves 87% Speech Reasoning Accuracy Using Amazon Nova 2 Sonic, Outperforming GPT Realtime and Gemini

Loka built a conversational voice agent using Amazon Nova 2 Sonic that achieved 87.0% speech reasoning accuracy on Big Bench Audio, surpassing GPT Realtime at 83.0% and Gemini 2.5 Flash Native Audio at 71.0%. The system delivers Time to First Audio of 1.39 seconds at approximately $0.27 per hour of input audio.

June 24, 2026 · 5:05 PM

June 9, 2026

product update

Google launches Gemini 3.5 Live Translate with continuous speech-to-speech in 70+ languages

Google announced Gemini 3.5 Live Translate, a speech-to-speech translation model supporting over 70 languages with continuous audio generation. The model rolls out today to Google Translate on Android and iOS, with Google Meet integration coming in private preview this month for select Workspace customers.

June 9, 2026 · 3:50 PM

June 8, 2026

product updateAmazon Web Services

AWS releases open-source test harness for evaluating Amazon Nova Sonic voice agents at scale

Amazon has released an open-source testing framework for Nova Sonic voice agents that automates multi-turn conversation evaluation without requiring human testers. The harness uses LLM-as-judge techniques to assess voice agents across six metrics including goal achievement, response accuracy, and tool usage, addressing a critical QA bottleneck in voice AI development.

June 8, 2026 · 4:05 PM

May 13, 2026

product updateAmazon Web Services

AWS Launches WebRTC Integration for Amazon Nova Sonic Real-Time Voice Streaming

AWS has integrated WebRTC protocol support with Amazon Nova Sonic, its speech-to-speech model, through Amazon Kinesis Video Streams. The integration delivers real-time voice streaming with sub-second latency and includes adaptive bitrate control, forward error correction, and Voice Activity Detection for mobile and IoT applications.

May 13, 2026 · 6:05 PM

April 28, 2026

product updateAmazon Web Services

Amazon Nova 2 Sonic Unifies Speech Recognition, Reasoning, and TTS in Single Streaming Model

Amazon Web Services released technical guidance for migrating text agents to voice assistants using Amazon Nova 2 Sonic, a native speech-to-speech model that combines automatic speech recognition, reasoning, tool calling, and text-to-speech in a single bidirectional streaming interface. The model supports asynchronous tool calling and built-in voice activity detection for handling interruptions.

April 28, 2026 · 6:06 PM

← Back to all news