Mellum2-12B-A2.5B-Thinking

Name: Mellum2-12B-A2.5B-Thinking
Author: JetBrains

JetBrains CZ

active

Compare with other models →

Context window131K tokens

Version History

v1majorJune 2, 2026

Initial release of Mellum2-12B-A2.5B-Thinking, a reasoning-augmented model trained with supervised fine-tuning and reinforcement learning with verifiable rewards on math-heavy data.

Benchmark Scores

Full leaderboard →

58.4%

AIME 2025

57.6%

GPQA

69.9%

LiveCodeBench

Coverage

model releaseJetBrains

JetBrains Releases Mellum2-12B Reasoning Model with 131K Context and Mixture-of-Experts Architecture

JetBrains has released Mellum2-12B-A2.5B-Thinking, a reasoning-augmented assistant model with 131,072-token context window and 64 Mixture-of-Experts architecture that activates 8 experts per token. The model emits explicit chain-of-thought reasoning inside <think> blocks before providing final answers.

June 2, 2026 · 9:06 AM2 min read

JetBrains Mellum2 reasoning-models