streaming

8 articles tagged with streaming

June 4, 2026

model releaseNVIDIA

NVIDIA Releases Nemotron 3.5 ASR: 600M-Parameter Streaming Speech Model for 40 Languages

NVIDIA released Nemotron 3.5 ASR, a 600M-parameter speech-to-text model supporting 40 language-locales from a single checkpoint. The model achieves 0.07 seconds to final transcript after speech ends and ranks 2nd in latency among streaming ASR models according to Artificial Analysis benchmarks.

June 4, 2026 · 1:06 PM

May 21, 2026

changelogAnthropic

Anthropic Python SDK v0.104.0 adds thinking token count estimates for streaming responses

Anthropic released version 0.104.0 of its Python SDK on May 21, 2026. The update adds support for a thinking-token-count beta feature that provides estimated token counts in thinking block deltas when streaming responses from reasoning models.

May 21, 2026 · 8:20 PM

May 20, 2026

product updateAmazon Web Services

AWS SageMaker AI adds bidirectional streaming for real-time speech transcription with vLLM

Amazon SageMaker AI has launched bidirectional streaming support for real-time inference, enabling WebSocket-based voice applications through vLLM integration. The feature uses HTTP/2 on port 8443 to bridge client connections with vLLM's Realtime API, allowing audio to stream in while transcription streams back simultaneously over a single persistent connection.

May 20, 2026 · 5:20 PM

April 28, 2026

product updateAmazon Web Services

Amazon Nova 2 Sonic Unifies Speech Recognition, Reasoning, and TTS in Single Streaming Model

Amazon Web Services released technical guidance for migrating text agents to voice assistants using Amazon Nova 2 Sonic, a native speech-to-speech model that combines automatic speech recognition, reasoning, tool calling, and text-to-speech in a single bidirectional streaming interface. The model supports asynchronous tool calling and built-in voice activity detection for handling interruptions.

April 28, 2026 · 6:06 PM

April 9, 2026

product updateOpenAI

ChatGPT now integrates Tubi TV app for searching 300,000+ movies and shows

OpenAI's ChatGPT has integrated Tubi TV, the ad-supported streaming service, allowing users to search Tubi's catalog of over 300,000 movies and TV episodes directly through the AI. Tubi becomes the first streaming service to integrate with ChatGPT's app ecosystem, available across web, desktop, and mobile platforms.

April 9, 2026 · 5:05 PM

April 8, 2026

product updateOpenAI

ChatGPT launches first streaming video app with Tubi for content discovery

OpenAI's ChatGPT has launched its first streaming video service integration, partnering with Tubi. The native app lets users search Tubi's catalog of over 300,000 movies and TV episodes using natural language queries.

April 8, 2026 · 7:35 PM

March 30, 2026

product updateAmazon Web Services

AWS launches agentic AI movie assistant using Nova Sonic 2.0 and Bedrock AgentCore

Amazon Web Services unveiled an agentic AI system for streaming platforms combining Nova Sonic 2.0 (real-time speech model), Bedrock AgentCore, and the Model Context Protocol. The system delivers two core capabilities: context-aware movie recommendations based on mood and viewing history, and real-time scene analysis including actor identification and plot summaries.

March 30, 2026 · 3:35 PM

March 26, 2026

product updateAmazon Web Services

Amazon Polly adds bidirectional streaming API for real-time speech synthesis in conversational AI

Amazon has released a new Bidirectional Streaming API for Amazon Polly that enables simultaneous text input and audio output over a single HTTP/2 connection. The API reduces end-to-end latency by 39% compared to traditional request-response TTS by allowing text to be sent word-by-word as LLMs generate tokens, rather than waiting for complete sentences. The feature is available in Java, JavaScript, .NET, C++, Go, Kotlin, PHP, Ruby, Rust, and Swift SDKs.

March 26, 2026 · 5:20 PM

← Back to all news