OpenAI launches GPT-Realtime-2 with GPT-5-class reasoning, adds real-time translation across 70 languages
OpenAI has added three voice intelligence features to its Realtime API: GPT-Realtime-2 with GPT-5-class reasoning for complex conversational requests, GPT-Realtime-Translate supporting 70 input languages and 13 output languages, and GPT-Realtime-Whisper for live speech-to-text transcription. Translation and transcription are billed by the minute, while GPT-Realtime-2 uses token-based pricing.
OpenAI launches GPT-Realtime-2 with GPT-5-class reasoning, adds real-time translation across 70 languages
OpenAI has released three new voice intelligence features in its Realtime API: GPT-Realtime-2, a voice model with GPT-5-class reasoning; GPT-Realtime-Translate for real-time conversational translation; and GPT-Realtime-Whisper for live transcription.
GPT-Realtime-2: Voice with advanced reasoning
GPT-Realtime-2 succeeds GPT-Realtime-1.5 and includes what OpenAI describes as "GPT-5-class reasoning" designed to handle complex user requests during voice conversations. The model creates realistic vocal simulations and can converse with users while applying advanced reasoning capabilities. Pricing is token-based, though specific rates were not disclosed.
GPT-Realtime-Translate: 70 input languages, 13 output languages
The translation feature supports 70 input languages (languages it can understand) and 13 output languages (languages it can speak). According to OpenAI, the system provides real-time translation that "keeps pace" with conversational flow. The feature is billed by the minute, with pricing not yet disclosed.
GPT-Realtime-Whisper: Live transcription
GPT-Realtime-Whisper adds live speech-to-text capabilities, capturing transcriptions as conversations occur. Like the translation feature, it is billed by the minute.
Target use cases and safeguards
OpenAI positions these features for customer service, education, media, events, and creator platforms. The company stated it has implemented guardrails to prevent misuse for spam, fraud, or abuse. Conversations can be automatically halted if they violate OpenAI's harmful content guidelines, though the company did not specify how these triggers operate.
According to OpenAI, the new models move real-time audio "from simple call-and-response toward voice interfaces that can actually do work: listen, reason, translate, transcribe, and take action as a conversation unfolds."
What this means
The addition of GPT-5-class reasoning to voice models marks a capability upgrade beyond the previous generation, though OpenAI has not released GPT-5 itself or clarified what "GPT-5-class" specifically means in terms of benchmark performance. The 70-language translation support is substantial for multilingual applications, but the 13-output language limitation means many users will be able to understand the system but not receive responses in their native language. The per-minute billing for translation and transcription differs from the token-based model used for GPT-Realtime-2, which may affect cost predictability for developers building conversational applications.
Related Articles
OpenAI releases GPT-Realtime-2 reasoning voice model with two specialized variants for translation and transcription
OpenAI has released three new realtime voice models through its Realtime API: GPT-Realtime-2 with GPT-5-class reasoning capabilities, GPT-Realtime-Translate supporting 70 input languages, and GPT-Realtime-Whisper for streaming transcription. The models are priced at $32-64 per 1M audio tokens for GPT-Realtime-2, and $0.017-0.034 per minute for the specialized variants.
OpenAI releases GPT-5.5-Cyber for vetted security teams with relaxed safeguards
OpenAI released GPT-5.5-Cyber in limited preview on Thursday, a variant of its GPT-5.5 model with relaxed safeguards for vetted cybersecurity teams. The model is trained to be more permissive on security-related tasks including vulnerability identification, patch validation, and malware analysis.
OpenAI Opens GPT-5.5-Cyber to Vetted Defenders After Model Matches Anthropic's Mythos in Security Testing
OpenAI is providing a less-restricted version of GPT-5.5 to vetted cybersecurity defenders through its Trusted Access for Cyber program. The model, dubbed GPT-5.5-Cyber, completed a 32-step simulated corporate cyberattack in 2 out of 10 test runs according to the U.K. AI Security Institute, narrowly trailing Anthropic's Mythos which succeeded in 3 out of 10 attempts.
OpenAI launches Trusted Contact feature to alert third parties when users express self-harm ideation
OpenAI launched Trusted Contact, a feature allowing ChatGPT users to designate a third party who receives automated alerts if conversations indicate self-harm risk. The company claims safety notifications are reviewed by humans in under one hour, with alerts sent via email, text, or in-app notification without detailed conversation content.
Comments
Loading...