model releaseXiaomi

Xiaomi Launches MiMo-V2.5-Pro with 1M Context Window for Complex Agentic Tasks

TL;DR

Xiaomi released MiMo-V2.5-Pro on April 22, 2026, its flagship model featuring a 1,048,576 token context window and pricing at $1 per million input tokens and $3 per million output tokens. According to Xiaomi, the model ranks highly on ClawEval, GDPVal, and SWE-bench Pro benchmarks, designed for autonomous completion of professional tasks requiring thousands of tool calls.

2 min read
0

Xiaomi Launches MiMo-V2.5-Pro with 1M Context Window for Complex Agentic Tasks

Xiaomi released MiMo-V2.5-Pro on April 22, 2026, its flagship model featuring a 1,048,576 token context window and pricing at $1 per million input tokens and $3 per million output tokens.

Technical Specifications

The model's extended context window — just over 1 million tokens — positions it for integration with agent frameworks requiring long-form context retention. Xiaomi claims the model can autonomously complete professional tasks that would take human experts days or weeks, involving more than a thousand tool calls per task.

Benchmark Performance

According to Xiaomi, MiMo-V2.5-Pro achieves top rankings on:

  • ClawEval: Benchmark for agentic capabilities (specific score not disclosed)
  • GDPVal: General development proficiency evaluation (specific score not disclosed)
  • SWE-bench Pro: Software engineering benchmark (specific score not disclosed)

Xiaomi has not released exact numerical scores for these benchmarks at launch.

Target Use Cases

The company positions MiMo-V2.5-Pro for three primary applications:

  1. General agentic capabilities with extended autonomous operation
  2. Complex software engineering tasks
  3. Long-horizon tasks requiring persistent context across multiple steps

The model is available through OpenRouter, which routes requests to providers with automatic fallbacks for uptime optimization.

Pricing and Availability

MiMo-V2.5-Pro is now available at:

  • Input: $1.00 per million tokens
  • Output: $3.00 per million tokens

This pricing places it in the mid-tier range for flagship models, notably below comparable offerings from Anthropic and OpenAI with similar context windows.

What This Means

Xiaomi's entry with a 1M context window model at competitive pricing adds another option in the expanding agentic AI market. The emphasis on "thousands of tool calls" suggests optimization for complex multi-step workflows rather than single-turn generation. However, without published benchmark scores or independent verification, the claimed performance advantages over existing models remain unconfirmed. The model's actual differentiation will depend on real-world testing in software engineering and agent deployment scenarios.

Related Articles

model release

Nvidia releases Nemotron 3 Ultra: 550B-parameter MoE model with 1M context window for agentic workflows

Nvidia has released Nemotron 3 Ultra, a 550-billion parameter mixture-of-experts model with 55 billion active parameters and support for up to 1 million token context windows. The model uses a hybrid Transformer-Mamba architecture and is designed specifically for long-running agentic workflows including agent orchestration, coding agents, and complex enterprise tasks.

model release

NVIDIA Nemotron 3 Ultra launches on AWS SageMaker with 550B parameters, 1M token context window

NVIDIA Nemotron 3 Ultra is now available on Amazon SageMaker JumpStart with 550 billion total parameters and 55 billion active parameters. The model features a hybrid Transformer-Mamba Mixture-of-Experts architecture and supports context windows up to 1 million tokens, targeting agentic AI workloads.

model release

Ideogram 4: 9.3B parameter open-weight text-to-image model with native 2K resolution and structured JSON prompting

Ideogram has released Ideogram 4, its first open-weight text-to-image model with 9.3 billion parameters. The model supports native 2K resolution, structured JSON prompting with bounding-box layout controls, and is available in nf4 and fp8 quantizations under a non-commercial license.

model release

Ideogram Releases First Open-Weight Image Model With 9.3B Parameters and 2K Native Resolution

Ideogram has released Ideogram 4, a 9.3B parameter open-weight text-to-image model trained from scratch. The model features structured JSON prompting, native 2K resolution output, and ranks as the top open-weight model on Design Arena. Available in fp8 and nf4 quantizations under a non-commercial license.

Comments

Loading...