Grok 4.20

xAI🇺🇸 United States

active

Compare with other models →

Context window2000K tokens

Input / 1M tokens$2

Output / 1M tokens$6

Version History

4.20majorMarch 31, 2026

Grok 4.20 is xAI's flagship release featuring a 2 million token context window, toggleable reasoning capabilities, and native agentic tool support. The model claims industry-leading speed with low hallucination rates.

Benchmark Scores

Full leaderboard →

96.7%

AIME 2024

97.0%

AIME 2025

96.9%

DocVQA

95.5%

GPQA

98.2%

HumanEval

85.0%

LiveCodeBench

98.8%

MATH

95.2%

MMLU

92.8%

MMLU-Pro

86.2%

MMMU

215.0 tokens_per_sec

Speed (tok/s)

84.2%

SWE-bench Verified

Coverage

model releasexAI

xAI releases Grok 4.20 with 2M context window and native reasoning capabilities

xAI released Grok 4.20 on March 31, 2026, its flagship model featuring a 2 million token context window, $2 per million input tokens and $6 per million output tokens pricing, and toggleable reasoning capabilities. The model includes web search functionality at $5 per 1,000 queries and claims industry-leading speed with low hallucination rates.

March 31, 2026 · 7:20 PM2 min read

grok xai model-release

benchmarkxAI

Grok 4.20 trails GPT-5.4 and Gemini 3.1 but achieves record 78% non-hallucination rate

xAI's Grok 4.20 scores 48 on Artificial Analysis' Intelligence Index—6 points ahead of Grok 4 but trailing Gemini 3.1 Pro Preview and GPT-5.4 at 57. The model distinguishes itself with a 78% non-hallucination rate on the AA Omniscience test, the highest recorded across any model tested.

March 14, 2026 · 6:38 PM2 min read

Grok xAI benchmark