Xiaomi Launches MiMo-V2.5 With 1M Context Window at $0.40 per Million Input Tokens
Xiaomi released MiMo-V2.5 on April 22, 2026, a native omnimodal model with a 1,048,576 token context window. The model is priced at $0.40 per million input tokens and $2 per million output tokens, positioning it as a cost-efficient alternative for agentic applications requiring multimodal perception across image and video understanding.
MiMo-V2.5 — Quick Specs
Xiaomi Launches MiMo-V2.5 With 1M Context Window at $0.40 per Million Input Tokens
Xiaomi released MiMo-V2.5 on April 22, 2026, a native omnimodal model featuring a 1,048,576 token context window priced at $0.40 per million input tokens and $2 per million output tokens.
Specifications and Pricing
MiMo-V2.5 offers:
- Context window: 1,048,576 tokens (1M)
- Input pricing: $0.40 per million tokens
- Output pricing: $2 per million tokens
- Release date: April 22, 2026
According to Xiaomi, the model delivers "Pro-level agentic performance at roughly half the inference cost" compared to unspecified alternatives, though the company has not provided independent benchmark scores to verify these claims.
Technical Capabilities
Xiaomi describes MiMo-V2.5 as a "native omnimodal model" designed for multimodal perception across image and video understanding tasks. The company claims the model surpasses its predecessor, MiMo-V2-Omni, in multimodal perception, though specific benchmark comparisons were not disclosed.
The 1M context window is designed to handle complete documents, extended conversations, and complex task contexts in a single inference pass. Xiaomi positions this capability as particularly suited for integration with agent frameworks.
Availability
The model is currently available through OpenRouter, which routes requests across multiple providers to optimize uptime and handle varying prompt sizes. OpenRouter supports the model's reasoning capabilities through a dedicated reasoning parameter that exposes step-by-step thinking processes via a reasoning_details array in API responses.
What This Means
MiMo-V2.5 enters an increasingly competitive omnimodal model market with a clear value proposition: extended context at lower input pricing than many enterprise-tier alternatives. At $0.40 per million input tokens, it undercuts several comparable models while offering a 1M context window—a specification typically reserved for premium tiers.
The focus on agentic workflows suggests Xiaomi is targeting developers building autonomous systems that require sustained reasoning across multimodal inputs. However, without published benchmark scores on standard evaluation sets like MMLU, VQAv2, or video understanding benchmarks, independent assessment of the model's claimed performance advantages remains difficult. The model's effectiveness will ultimately be determined by real-world deployment results in production agent systems.
Related Articles
Alibaba Qwen Releases 27B Parameter Model with 262K Context Window, Claims 77.2% on SWE-bench Verified
Alibaba Qwen released Qwen3.6-27B, a 27-billion parameter model with a 262,144 token context window extensible to 1,010,000 tokens. The model claims 77.2% on SWE-bench Verified and 53.5% on SWE-bench Pro, with open weights available on Hugging Face.
OpenAI Releases GPT-5.4 Image 2 with 272K Context Window and Image Generation
OpenAI has released GPT-5.4 Image 2, combining the GPT-5.4 reasoning model with image generation capabilities. The multimodal model features a 272K token context window and is priced at $8 per million input tokens and $15 per million output tokens.
Xiaomi Launches MiMo-V2.5-Pro with 1M Context Window for Complex Agentic Tasks
Xiaomi released MiMo-V2.5-Pro on April 22, 2026, its flagship model featuring a 1,048,576 token context window and pricing at $1 per million input tokens and $3 per million output tokens. According to Xiaomi, the model ranks highly on ClawEval, GDPVal, and SWE-bench Pro benchmarks, designed for autonomous completion of professional tasks requiring thousands of tool calls.
OpenAI releases ChatGPT Images 2.0 with 3840x2160 resolution at $30 per 1M output tokens
OpenAI released ChatGPT Images 2.0, pricing output tokens at $30 per million with maximum resolution of 3840x2160 pixels. CEO Sam Altman claims the improvement from gpt-image-1 to gpt-image-2 equals the jump from GPT-3 to GPT-5.
Comments
Loading...