Xiaomi releases MiMo-V2-Pro with 1M context window and 1T+ parameters
Xiaomi released MiMo-V2-Pro on March 18, 2026, a flagship foundation model with over 1 trillion total parameters and a 1,048,576 token context window. The model is priced at $1 per million input tokens and $3 per million output tokens, positioning it as an agent-focused system comparable to top-tier models.
Xiaomi MiMo-V2-Pro — Quick Specs
Xiaomi Releases MiMo-V2-Pro With 1M Context Window and 1T+ Parameters
Xiaomi released MiMo-V2-Pro on March 18, 2026, a foundation model featuring over 1 trillion total parameters and a 1,048,576 token context window. The model is priced at $1 per million input tokens and $3 per million output tokens.
Technical Specifications
MiMo-V2-Pro is positioned as Xiaomi's flagship foundation model, designed primarily for agentic scenarios and complex workflow orchestration. The model features:
- Context window: 1,048,576 tokens (1M)
- Parameter count: Over 1 trillion total parameters
- Input pricing: $1 per million tokens
- Output pricing: $3 per million tokens
- Release date: March 18, 2026
Benchmarks and Performance Claims
According to Xiaomi, MiMo-V2-Pro "ranks among the global top tier in the standard PinchBench and ClawBench benchmarks, with perceived performance approaching that of Opus 4.6." The company does not publish specific benchmark scores, instead relying on comparative claims against existing models.
OpenRouter's usage data shows the model handling 319 billion prompt tokens, 1.03 billion completion tokens, and 476 million reasoning tokens in recent tracking periods.
Agentic Focus and Integration
MiMo-V2-Pro is explicitly optimized for agent frameworks, with Xiaomi citing OpenClaw compatibility as a key feature. The company describes the model as "designed to serve as the brain of agent systems, orchestrating complex workflows, driving production engineering tasks, and delivering results reliably."
The model supports reasoning-enabled inference through OpenRouter, allowing access to step-by-step thinking processes via the reasoning parameter in API requests.
Availability and Distribution
MiMo-V2-Pro is accessible through OpenRouter, which handles provider routing and fallback mechanisms to maximize uptime. OpenRouter normalizes API requests and responses across multiple provider implementations.
What This Means
Xiaomi enters the large foundation model market with competitive context window sizing (matching or exceeding most current offerings) and aggressive pricing on the output tier ($3/M output tokens). The focus on agent-optimized design suggests Xiaomi is targeting enterprise automation workflows rather than general-purpose chat applications. The 1M context window places MiMo-V2-Pro in the extended-context category used by models handling document analysis and multi-turn agent reasoning, though specific benchmark data remains unavailable for direct performance comparison. Xiaomi's entry signals continued fragmentation in the foundation model market, with regional players (Chinese tech companies like Xiaomi, Alibaba/Qwen, Baidu, Tencent) establishing independent model lines alongside US-based competitors.
Related Articles
Xiaomi launches MiMo-V2-Pro with 1T parameters, matches Claude Opus on coding at 80% lower cost
Xiaomi shipped three AI models simultaneously designed to form a complete agent platform. MiMo-V2-Pro, a 1-trillion-parameter Mixture-of-Experts model with 42 billion active parameters per request, scores 78% on SWE-bench Verified and 81 points on ClawEval—nearly matching Claude Opus 4.6 while costing $1 per million input tokens versus $5 for Opus.
Cursor releases Composer 2 at $0.50/$2.50 per 1M tokens, undercutting Claude and GPT-4 on pricing
Cursor released Composer 2, a code-specialized model priced at $0.50 per million input tokens and $2.50 per million output tokens—roughly 90% cheaper than Claude Opus 4.6 ($5.00/$25.00) and 60% cheaper than GPT-5.4 ($2.50/$15.00). The model scores 61.3 on Cursor's internal CursorBench, competitive with Claude Opus 4.6 (58.2) but below GPT-5.4 Thinking (63.9).
OpenAI releases GPT-4o mini with 128K context at $0.15/$0.60 per 1M tokens
OpenAI released GPT-4o mini on July 18, 2024, a compact multimodal model with 128,000 token context window priced at $0.15 per million input tokens and $0.60 per million output tokens. The model achieves 82% on MMLU and claims to rank higher than GPT-4 on chat preference leaderboards while costing 60% less than GPT-3.5 Turbo.
OpenAI releases GPT-5.4 mini and nano with 3-4x price increases but major performance gains
OpenAI has released GPT-5.4 mini and GPT-5.4 nano, compact models optimized for coding and subagent tasks. The new models deliver significant performance improvements—GPT-5.4 mini reaches 54.4% on SWE-Bench Pro versus 45.7% for GPT-5 mini—but cost 3-4x more per input token than their predecessors.