model releaseXiaomi

Xiaomi releases MiMo-V2-Pro with 1M context window and 1T+ parameters

TL;DR

Xiaomi released MiMo-V2-Pro on March 18, 2026, a flagship foundation model with over 1 trillion total parameters and a 1,048,576 token context window. The model is priced at $1 per million input tokens and $3 per million output tokens, positioning it as an agent-focused system comparable to top-tier models.

2 min read
0

Xiaomi MiMo-V2-Pro — Quick Specs

Context window1049K tokens
Input$1/1M tokens
Output$3/1M tokens

Xiaomi Releases MiMo-V2-Pro With 1M Context Window and 1T+ Parameters

Xiaomi released MiMo-V2-Pro on March 18, 2026, a foundation model featuring over 1 trillion total parameters and a 1,048,576 token context window. The model is priced at $1 per million input tokens and $3 per million output tokens.

Technical Specifications

MiMo-V2-Pro is positioned as Xiaomi's flagship foundation model, designed primarily for agentic scenarios and complex workflow orchestration. The model features:

  • Context window: 1,048,576 tokens (1M)
  • Parameter count: Over 1 trillion total parameters
  • Input pricing: $1 per million tokens
  • Output pricing: $3 per million tokens
  • Release date: March 18, 2026

Benchmarks and Performance Claims

According to Xiaomi, MiMo-V2-Pro "ranks among the global top tier in the standard PinchBench and ClawBench benchmarks, with perceived performance approaching that of Opus 4.6." The company does not publish specific benchmark scores, instead relying on comparative claims against existing models.

OpenRouter's usage data shows the model handling 319 billion prompt tokens, 1.03 billion completion tokens, and 476 million reasoning tokens in recent tracking periods.

Agentic Focus and Integration

MiMo-V2-Pro is explicitly optimized for agent frameworks, with Xiaomi citing OpenClaw compatibility as a key feature. The company describes the model as "designed to serve as the brain of agent systems, orchestrating complex workflows, driving production engineering tasks, and delivering results reliably."

The model supports reasoning-enabled inference through OpenRouter, allowing access to step-by-step thinking processes via the reasoning parameter in API requests.

Availability and Distribution

MiMo-V2-Pro is accessible through OpenRouter, which handles provider routing and fallback mechanisms to maximize uptime. OpenRouter normalizes API requests and responses across multiple provider implementations.

What This Means

Xiaomi enters the large foundation model market with competitive context window sizing (matching or exceeding most current offerings) and aggressive pricing on the output tier ($3/M output tokens). The focus on agent-optimized design suggests Xiaomi is targeting enterprise automation workflows rather than general-purpose chat applications. The 1M context window places MiMo-V2-Pro in the extended-context category used by models handling document analysis and multi-turn agent reasoning, though specific benchmark data remains unavailable for direct performance comparison. Xiaomi's entry signals continued fragmentation in the foundation model market, with regional players (Chinese tech companies like Xiaomi, Alibaba/Qwen, Baidu, Tencent) establishing independent model lines alongside US-based competitors.

Related Articles

model release

Mistral Releases Voxtral TTS: 4B Parameter Text-to-Speech Model at $0.016 per 1k Characters

Mistral AI has released Voxtral TTS, a 4B parameter text-to-speech model supporting 9 languages including English, French, German, Spanish, Dutch, Portuguese, Italian, Hindi, and Arabic. The model achieves 70ms latency for typical inputs and can clone voices from as little as 3 seconds of audio, priced at $0.016 per 1,000 characters.

model release

Mistral OCR 3 launches at $2 per 1,000 pages with 74% win rate over previous version

Mistral AI released Mistral OCR 3, a document extraction model priced at $2 per 1,000 pages ($1 with Batch API discount). The model achieves a 74% overall win rate over its predecessor on forms, scanned documents, complex tables, and handwriting according to internal benchmarks.

model release

Mistral Releases Mistral 3 Family: 675B-Parameter Large 3 MoE and Three Edge Models Under Apache 2.0

Mistral has released Mistral 3, including Mistral Large 3—a sparse mixture-of-experts model with 41B active and 675B total parameters—and three Ministral 3 edge models (3B, 8B, 14B). All models are released under Apache 2.0 license with multimodal capabilities and are available today on multiple platforms.

model release

Google releases Gemini 3.1 Flash Image, claims Pro-level quality at $0.50 per 1M tokens

Google has released Gemini 3.1 Flash Image, internally codenamed "Nano Banana 2," an image generation and editing model with a 131K context window. The model is priced at $0.50 per 1M input tokens and $3 per 1M output tokens.

Comments

Loading...