MiniMax M3

MiniMax🇨🇳 China
active
Context window1000K tokens
Input / 1M tokens$0.3
Output / 1M tokens$1.2

Version History

m3major

M3 introduces MiniMax Sparse Attention to enable 1M-token context at approximately 1/20th the compute cost of previous generation. Native multimodal training on interleaved data with interactive user-simulator tuning.

Coverage