accuracy

3 articles tagged with accuracy

May 5, 2026

changelogOpenAI

OpenAI releases GPT-5.5 Instant as default ChatGPT model with 52.5% fewer hallucinations

OpenAI released GPT-5.5 Instant as the new default ChatGPT model on May 5, 2026. The company claims the update produces 52.5% fewer hallucinations on high-stakes prompts and 37.3% fewer inaccurate claims on challenging conversations compared to GPT-5.3 Instant.

May 5, 2026 · 5:35 PM

April 7, 2026

benchmark

Google AI Overviews reach 91% accuracy with Gemini 3, but 56% of answers lack verifiable sources

An independent study by AI startup Oumi found that Google's AI Overviews answered correctly 91% of the time with Gemini 3, up from 85% with Gemini 2, based on 4,326 searches using the SimpleQA benchmark. However, 56% of correct answers in Gemini 3 could not be verified through the linked sources—a significant increase from 37% in Gemini 2—and at Google's scale, a 9% error rate still translates to millions of wrong answers per hour.

April 7, 2026 · 7:05 PM

February 28, 2026

benchmarkOpenAI

Frontier LLMs lose up to 33% accuracy in long conversations, study finds

Frontier language models including GPT-5.2 and Claude 4.6 experience accuracy degradation of up to 33% as conversations lengthen, according to new research. The finding suggests that extended context use within a single conversation introduces performance challenges even in state-of-the-art models.

February 28, 2026 · 6:05 PM

← Back to all news