model releaseJetBrains
JetBrains Releases Mellum2-12B Reasoning Model with 131K Context and Mixture-of-Experts Architecture
JetBrains has released Mellum2-12B-A2.5B-Thinking, a reasoning-augmented assistant model with 131,072-token context window and 64 Mixture-of-Experts architecture that activates 8 experts per token. The model emits explicit chain-of-thought reasoning inside <think> blocks before providing final answers.
2 min read