streaming
5 articles tagged with streaming
Amazon Nova 2 Sonic Unifies Speech Recognition, Reasoning, and TTS in Single Streaming Model
Amazon Web Services released technical guidance for migrating text agents to voice assistants using Amazon Nova 2 Sonic, a native speech-to-speech model that combines automatic speech recognition, reasoning, tool calling, and text-to-speech in a single bidirectional streaming interface. The model supports asynchronous tool calling and built-in voice activity detection for handling interruptions.
ChatGPT now integrates Tubi TV app for searching 300,000+ movies and shows
OpenAI's ChatGPT has integrated Tubi TV, the ad-supported streaming service, allowing users to search Tubi's catalog of over 300,000 movies and TV episodes directly through the AI. Tubi becomes the first streaming service to integrate with ChatGPT's app ecosystem, available across web, desktop, and mobile platforms.
ChatGPT launches first streaming video app with Tubi for content discovery
OpenAI's ChatGPT has launched its first streaming video service integration, partnering with Tubi. The native app lets users search Tubi's catalog of over 300,000 movies and TV episodes using natural language queries.
AWS launches agentic AI movie assistant using Nova Sonic 2.0 and Bedrock AgentCore
Amazon Web Services unveiled an agentic AI system for streaming platforms combining Nova Sonic 2.0 (real-time speech model), Bedrock AgentCore, and the Model Context Protocol. The system delivers two core capabilities: context-aware movie recommendations based on mood and viewing history, and real-time scene analysis including actor identification and plot summaries.
Amazon Polly adds bidirectional streaming API for real-time speech synthesis in conversational AI
Amazon has released a new Bidirectional Streaming API for Amazon Polly that enables simultaneous text input and audio output over a single HTTP/2 connection. The API reduces end-to-end latency by 39% compared to traditional request-response TTS by allowing text to be sent word-by-word as LLMs generate tokens, rather than waiting for complete sentences. The feature is available in Java, JavaScript, .NET, C++, Go, Kotlin, PHP, Ruby, Rust, and Swift SDKs.