xAI releases Grok 4.20 with 2M context window and native reasoning capabilities
xAI released Grok 4.20 on March 31, 2026, its flagship model featuring a 2 million token context window, $2 per million input tokens and $6 per million output tokens pricing, and toggleable reasoning capabilities. The model includes web search functionality at $5 per 1,000 queries and claims industry-leading speed with low hallucination rates.
Grok 4.20 — Quick Specs
xAI Releases Grok 4.20 With 2M Context and Reasoning Mode
xAI released Grok 4.20 on March 31, 2026. The model features a 2 million token context window, priced at $2 per million input tokens and $6 per million output tokens. Web search functionality costs $5 per 1,000 queries. Knowledge cutoff is September 1, 2025.
Key Specifications
Context and Pricing:
- Context window: 2,000,000 tokens
- Input pricing: $2/M tokens
- Output pricing: $6/M tokens
- Web search: $5 per 1,000 queries
Capabilities:
Grok 4.20 includes native agentic tool-calling abilities and an optional reasoning mode that can be enabled or disabled via API parameter. When enabled, the model exposes its step-by-step reasoning process through a reasoning_details array in the response. Users can preserve reasoning context across multi-turn conversations by passing the complete reasoning details back in subsequent requests.
Performance Claims
xAI claims Grok 4.20 delivers "industry-leading speed" combined with low hallucination rates and "strict prompt adherence." The company positions the model as producing "consistently precise and truthful responses," though specific benchmark scores against competing models have not been disclosed.
Technical Details
The model is accessible through OpenRouter, which normalizes API requests and responses across multiple providers and includes fallback routing to maximize uptime. OpenRouter's documentation indicates support for reasoning-enabled models with access to internal step-by-step thinking before final outputs.
API integration follows standard patterns with optional OpenRouter-specific headers for leaderboard attribution. Third-party SDK support is available through OpenRouter's framework documentation.
What This Means
Grok 4.20 enters a competitive market where context window size has become a commodity feature—Claude 3.5 Sonnet and other models already offer 200K context, and some specialty models exceed 1M. The 2M window is a notable advantage but not unprecedented. The model's value proposition rests on claimed speed advantages and agentic capabilities rather than raw context size alone. The optional reasoning mode, similar to features in newer OpenAI and Anthropic models, allows developers to choose between faster inference and detailed reasoning transparency based on use case requirements. Pricing at $2/$6 positions it at the premium end of the market—significantly higher than open-source alternatives but within range of other flagship models. The lack of disclosed benchmark scores leaves performance claims unverified; independent evaluation will be necessary to validate xAI's claims about hallucination rates and prompt adherence relative to competitors.
Related Articles
DeepSeek Releases V4 Models: 1M Context Window, 90% Less KV Cache Than V3
DeepSeek has released two new MoE models: DeepSeek-V4-Pro with 1.6T parameters (49B activated) and DeepSeek-V4-Flash with 284B parameters (13B activated). Both models support a one million token context window and use a hybrid attention architecture that requires only 27% of single-token inference FLOPs and 10% of KV cache compared to DeepSeek-V3.2.
DeepSeek Releases V4-Pro with 1.6T Parameters, 1M Token Context at 27% Inference Cost of V3
DeepSeek has released two Mixture-of-Experts models: V4-Pro with 1.6 trillion parameters (49B activated) and V4-Flash with 284B parameters (13B activated), both supporting 1 million token context windows. V4-Pro requires only 27% of inference FLOPs and 10% of KV cache compared to V3.2 at 1M token context, trained on over 32 trillion tokens.
OpenAI previews GPT-5.6 to select partners with three variants priced from $1 to $30 per million tokens
OpenAI has begun previewing its GPT-5.6 series to a limited group of trusted partners after government review. The release includes three variants: Sol at $5 input/$30 output per million tokens, Terra at $2.50/$15, and Luna at $1/$6.
OpenAI announces GPT-5.6 series with Sol flagship, Terra at 50% cost of GPT-5.5, and Luna budget model
OpenAI has begun a limited preview of its GPT-5.6 series, introducing three models: Sol (flagship), Terra (2x cheaper than GPT-5.5 with competitive performance), and Luna (lowest cost option). The models are launching first with trusted partners before general availability in coming weeks, following U.S. government preview requirements.
Comments
Loading...