Mellum2-12B-A2.5B-Thinking

active
Context window131K tokens

Version History

v1major

Initial release of Mellum2-12B-A2.5B-Thinking, a reasoning-augmented model trained with supervised fine-tuning and reinforcement learning with verifiable rewards on math-heavy data.

Coverage