LLM News

Every LLM release, update, and milestone.

Filtered by:Grok✕ clear

benchmarkxAI

Grok 4.20 trails GPT-5.4 and Gemini 3.1 but achieves record 78% non-hallucination rate

xAI's Grok 4.20 scores 48 on Artificial Analysis' Intelligence Index—6 points ahead of Grok 4 but trailing Gemini 3.1 Pro Preview and GPT-5.4 at 57. The model distinguishes itself with a 78% non-hallucination rate on the AA Omniscience test, the highest recorded across any model tested.

March 14, 2026 · 6:38 PM2 min read

Grok xAI benchmark

via the-decoder.com ↗