IBM's Granite 4.1: 8B Dense Model Matches 32B MoE Performance on 15T Tokens

IBM released Granite 4.1, a family of dense decoder-only LLMs (3B, 8B, 30B parameters) trained on approximately 15 trillion tokens using a five-phase pre-training pipeline. The 8B instruct model matches or surpasses the previous Granite 4.0-H-Small (32B-A9B MoE) despite using fewer parameters and a simpler dense architecture. All models support up to 512K context windows and are released under Apache 2.0 license.

April 29, 2026 · 3:21 PM3 min read

IBM Granite LLM

Granite 4.1 8B

Version History

Coverage

IBM's Granite 4.1: 8B Dense Model Matches 32B MoE Performance on 15T Tokens