Grok 4.20

xAI🇺🇸 United States
active
Context window2000K tokens
Input / 1M tokens$2
Output / 1M tokens$6

Version History

4.20major

Grok 4.20 is xAI's flagship release featuring a 2 million token context window, toggleable reasoning capabilities, and native agentic tool support. The model claims industry-leading speed with low hallucination rates.

Benchmark Scores

Full leaderboard →
96.7%
AIME 2024
97.0%
AIME 2025
96.9%
DocVQA
95.5%
GPQA
98.2%
HumanEval
85.0%
LiveCodeBench
98.8%
MATH
95.2%
MMLU
92.8%
MMLU-Pro
86.2%
MMMU
24.0 tokens_per_sec
Speed (tok/s)
84.2%
SWE-bench Verified

Coverage