Xiaomi Launches MiMo-V2.5-Pro with 1M Context Window for Complex Agentic Tasks
Xiaomi released MiMo-V2.5-Pro on April 22, 2026, its flagship model featuring a 1,048,576 token context window and pricing at $1 per million input tokens and $3 per million output tokens. According to Xiaomi, the model ranks highly on ClawEval, GDPVal, and SWE-bench Pro benchmarks, designed for autonomous completion of professional tasks requiring thousands of tool calls.
MiMo-V2.5-Pro — Quick Specs
Xiaomi Launches MiMo-V2.5-Pro with 1M Context Window for Complex Agentic Tasks
Xiaomi released MiMo-V2.5-Pro on April 22, 2026, its flagship model featuring a 1,048,576 token context window and pricing at $1 per million input tokens and $3 per million output tokens.
Technical Specifications
The model's extended context window — just over 1 million tokens — positions it for integration with agent frameworks requiring long-form context retention. Xiaomi claims the model can autonomously complete professional tasks that would take human experts days or weeks, involving more than a thousand tool calls per task.
Benchmark Performance
According to Xiaomi, MiMo-V2.5-Pro achieves top rankings on:
- ClawEval: Benchmark for agentic capabilities (specific score not disclosed)
- GDPVal: General development proficiency evaluation (specific score not disclosed)
- SWE-bench Pro: Software engineering benchmark (specific score not disclosed)
Xiaomi has not released exact numerical scores for these benchmarks at launch.
Target Use Cases
The company positions MiMo-V2.5-Pro for three primary applications:
- General agentic capabilities with extended autonomous operation
- Complex software engineering tasks
- Long-horizon tasks requiring persistent context across multiple steps
The model is available through OpenRouter, which routes requests to providers with automatic fallbacks for uptime optimization.
Pricing and Availability
MiMo-V2.5-Pro is now available at:
- Input: $1.00 per million tokens
- Output: $3.00 per million tokens
This pricing places it in the mid-tier range for flagship models, notably below comparable offerings from Anthropic and OpenAI with similar context windows.
What This Means
Xiaomi's entry with a 1M context window model at competitive pricing adds another option in the expanding agentic AI market. The emphasis on "thousands of tool calls" suggests optimization for complex multi-step workflows rather than single-turn generation. However, without published benchmark scores or independent verification, the claimed performance advantages over existing models remain unconfirmed. The model's actual differentiation will depend on real-world testing in software engineering and agent deployment scenarios.
Related Articles
Arcee AI Releases Trinity Large Preview: 400B-Parameter MoE Model with 512K Context Window
Arcee AI has released Trinity Large Preview, a 400B-parameter sparse Mixture-of-Experts model with 13B active parameters per token using 4-of-256 expert routing. The model supports context windows up to 512K tokens and is available with open weights under permissive licensing.
Anthropic releases Claude Opus 4.7 with improved coding and vision, confirms it trails unreleased Mythos model
Anthropic released Claude Opus 4.7 with improved coding capabilities, higher-resolution vision, and a new reasoning level. The company publicly acknowledged the model underperforms its unreleased Mythos system, which remains restricted due to safety concerns.
Xiaomi Launches MiMo-V2.5 With 1M Context Window at $0.40 per Million Input Tokens
Xiaomi released MiMo-V2.5 on April 22, 2026, a native omnimodal model with a 1,048,576 token context window. The model is priced at $0.40 per million input tokens and $2 per million output tokens, positioning it as a cost-efficient alternative for agentic applications requiring multimodal perception across image and video understanding.
OpenAI Releases GPT-5.4 Image 2 with 272K Context Window and Image Generation
OpenAI has released GPT-5.4 Image 2, combining the GPT-5.4 reasoning model with image generation capabilities. The multimodal model features a 272K token context window and is priced at $8 per million input tokens and $15 per million output tokens.
Comments
Loading...