xAI Launches Grok Build 0.1: Coding Model with 256K Context for Agentic Workflows
xAI has released Grok Build 0.1, a coding-specialized model with a 256K context window and unlimited text output. The model is designed for agentic software engineering workflows and powers xAI's Grok Build CLI tool.
Grok Build 0.1 — Quick Specs
xAI Launches Grok Build 0.1: Coding Model with 256K Context for Agentic Workflows
xAI has released Grok Build 0.1, a coding-specialized language model with a 256K token context window and no text output limit, according to the company.
Model Specifications
Grok Build 0.1 accepts both text and image inputs while generating text output. The model is currently in early access and available through OpenRouter's API platform.
Key specifications:
- Context window: 256K tokens
- Output limit: None
- Modalities: Text and image input, text output
- Release date: May 20, 2025 (according to OpenRouter listing)
- Pricing: Not yet disclosed
Purpose-Built for Coding Agents
xAI claims the model is "trained specifically for agentic software engineering workflows." The company positions it as optimized for:
- Interactive coding agents
- Tool use and function calling
- Multi-step development tasks
- Long-horizon coding projects
- Automation workflows
The model powers xAI's Grok Build CLI, a command-line interface tool for developers.
Technical Context
The 256K context window places Grok Build 0.1 in the upper tier of commercially available models, matching capabilities from Anthropic's Claude 3 series and Google's Gemini 1.5 models. The unlimited output generation is notable for coding use cases where generating complete files or large code blocks is common.
xAI has not disclosed benchmark scores, parameter count, or training data details. The company also has not specified whether this model is a variant of its existing Grok-2 architecture or represents a new model family.
What This Means
Grok Build 0.1 represents xAI's first model explicitly targeting the coding assistant and agentic development market, competing directly with OpenAI's o1 models, Anthropic's Claude family, and Google's Gemini for Developers. The emphasis on "agentic workflows" and CLI integration suggests xAI is pursuing the emerging category of autonomous coding agents rather than traditional code completion. Without pricing information or benchmark data, it remains unclear how Grok Build 0.1 compares to existing alternatives in performance or cost-effectiveness. The early access designation indicates limited availability as xAI likely refines the model based on developer feedback.
Related Articles
Google launches Gemini 3.1 Flash Lite Image with 4-second generation time, $0.25 per 1M input tokens
Google has released Gemini 3.1 Flash Lite Image, a text-to-image model that generates 1K resolution images in approximately 4 seconds — 2.7× faster than Gemini 3.1 Flash Image. The model is priced at $0.25 per 1M input tokens and $1.50 per 1M output tokens, with a 66K context window and knowledge cutoff of January 2025.
Mistral releases Leanstral 1.5: 119B parameter open-source model for Lean 4 proof assistance
Mistral AI has released Leanstral 1.5, an open-source 119B parameter mixture-of-experts model designed specifically for Lean 4 proof assistance. The model features 128 experts with 4 active per token (6.5B activated parameters), a 256k token context window, and multimodal input capabilities.
NVIDIA releases Nemotron-Labs-TwoTower-30B: block-wise diffusion model claims 2.42× faster generation at 98.7% baseline
NVIDIA released Nemotron-Labs-TwoTower-30B-A3B-Base-BF16, a block-wise diffusion language model that generates text by denoising blocks of tokens in parallel rather than sequentially. According to NVIDIA, the model achieves 2.42× the wall-clock generation throughput of its autoregressive baseline while retaining 98.7% of aggregate benchmark quality.
Mistral Releases Leanstral 1.5: 6B-Parameter Model Achieves 100% on miniF2F, Solves 587/672 PutnamBench Problems
Mistral AI released Leanstral 1.5, a free Apache-2.0 licensed model with 119B total parameters and 6B active parameters specialized for formal verification in Lean 4. The model achieves 100% on miniF2F benchmark, solves 587 of 672 PutnamBench problems at $4 per problem (versus $300+ for competitors), and reaches state-of-the-art 87% on FATE-H and 34% on FATE-X benchmarks.
Comments
Loading...