Grok 3
xAI🇺🇸 United States
xAI flagship with real-time X/Twitter data access and strong reasoning.
Context window131K tokens
Input / 1M tokens$3
Output / 1M tokens$15
Version History
grok-3-2025-10minor
Grok 3 October update extending knowledge cutoff and improving accuracy on real-time query grounding from X data.
grok-3-betamajor
Grok 3 beta. xAI claims #1 on AIME, GPQA, and LiveCodeBench at launch.
Benchmark Scores
Full leaderboard →83.9%
AIME 2024
82.7%
AIME 2025
75.3%
GPQA
91.2%
HumanEval
58.4%
LiveCodeBench
97.6%
MATH
87.5%
MMLU
78.1%
MMLU-Pro
55.0 tokens_per_sec
Speed (tok/s)
49.5%
SWE-bench Verified