Xiaomi
Chinese tech giant expanding into AI with the MiMo series of foundation models, optimized for agentic and multimodal scenarios.
https://www.mi.com →News
No articles yet.
Models
MiMo-V2-Omni
Xiaomi
Xiaomi's frontier omni-modal model that natively processes image, video, and audio inputs within a unified architecture. Combines strong multimodal perception with agentic capability including visual grounding, multi-step planning, tool use, and code execution.
Context262K
Input/1M$0.4
Mar 18, 2026
Open weightsMiMo-V2-Pro
Xiaomi
Xiaomi's flagship foundation model with over 1 trillion parameters and 1M context length. Deeply optimized for agentic scenarios. Ranks among global top tier on standard benchmarks, approaching Opus 4.6 levels.
Context1049K
Input/1M$1
Mar 18, 2026
Open weights