voice-ai

8 articles tagged with voice-ai

June 18, 2026

Mistral AI adds Deep Research agent, voice mode with Voxtral model to Le Chat

Mistral AI has released a major update to Le Chat, adding a Deep Research agent that generates structured research reports, a new voice input model called Voxtral, and Projects for organizing conversations. The update also includes multilingual reasoning powered by Mistral's Magistral model.

June 18, 2026 · 8:51 AM

May 18, 2026

product updateAmazon Web Services

Amazon launches AI-generated podcast feature in Alexa+ with real-time news from AP, Reuters, Washington Post

Amazon launched Alexa Podcasts today in the U.S., a feature that generates custom podcast episodes on demand using AI-generated host voices. The company claims partnerships with the Associated Press, Reuters, The Washington Post, and over 200 local newspapers to improve content accuracy.

May 18, 2026 · 3:05 PM

April 20, 2026

product updateAmazon Web Services

AWS Releases AgentCore Platform for Building Voice AI Agents with Nova 2 Sonic Integration

Amazon has released Bedrock AgentCore, a platform for building and deploying AI agents with microVM isolation for secure session handling. The platform integrates with Amazon Nova 2 Sonic, a speech-to-speech foundation model, and supports the Model Context Protocol (MCP) for connecting agents to backend services.

April 20, 2026 · 3:21 PM

March 30, 2026

product updateAmazon Web Services

AWS launches agentic AI movie assistant using Nova Sonic 2.0 and Bedrock AgentCore

Amazon Web Services unveiled an agentic AI system for streaming platforms combining Nova Sonic 2.0 (real-time speech model), Bedrock AgentCore, and the Model Context Protocol. The system delivers two core capabilities: context-aware movie recommendations based on mood and viewing history, and real-time scene analysis including actor identification and plot summaries.

March 30, 2026 · 3:35 PM

March 26, 2026

model release

Gemini 3.1 Flash Live scores 95.9% on Big Bench Audio, Google's fastest voice model

Google has released Gemini 3.1 Flash Live, its new voice and audio AI model, scoring 95.9% on the Big Bench Audio Benchmark at high thinking levels—second only to Step-Audio R1.1 Realtime at 97.0%. Response times range from 0.96 seconds at minimal thinking to 2.98 seconds at high thinking, with pricing held at $0.35 per hour of audio input and $1.40 per hour of audio output.

March 26, 2026 · 5:50 PM

model release

Google releases Gemini 3.1 Flash Live, claims improved audio recognition and lower latency for voice conversations

Google announced Gemini 3.1 Flash Live as its updated audio and voice model for Gemini Live and Search Live. The model claims improved acoustic recognition, better background noise filtering, support for over 90 languages, and lower latency compared to 2.5 Flash Native Audio.

March 26, 2026 · 4:20 PM

model release

Google releases Gemini 3.1 Flash Live, its highest-quality audio model for real-time voice AI

Google has released Gemini 3.1 Flash Live, its highest-quality audio and voice model designed for real-time dialogue. The model scores 90.8% on ComplexFuncBench Audio and 36.1% on Scale AI's Audio MultiChallenge with reasoning enabled, with improved tonal understanding and lower latency compared to previous versions.

March 26, 2026 · 3:35 PM

model release

Mistral releases Voxtral TTS, open-source speech model for enterprise voice agents

Mistral AI released Voxtral TTS, an open-source text-to-speech model designed for enterprise voice agents and edge devices. The model supports nine languages, adapts custom voices from samples under five seconds, and achieves 90ms time-to-first-audio latency with a 6x real-time factor.

March 26, 2026 · 11:35 AM

← Back to all news