Granite 4.1 8B

active
Context window131K tokens
Input / 1M tokens$0.05
Output / 1M tokens$0.1

Version History

4.1minor

Granite 4.1 8B is a new release in IBM's Granite 4.1 family, offering an 8B-parameter dense decoder-only model with enterprise-focused capabilities. Released under Apache 2.0 license with 131K context window and multilingual support.

Benchmark Scores

Full leaderboard →
42.0%
GPQA
85.4%
HumanEval
73.8%
MMLU
56.0%
MMLU-Pro

Coverage

model releaseIbm

IBM's Granite 4.1: 8B Dense Model Matches 32B MoE Performance on 15T Tokens

IBM released Granite 4.1, a family of dense decoder-only LLMs (3B, 8B, 30B parameters) trained on approximately 15 trillion tokens using a five-phase pre-training pipeline. The 8B instruct model matches or surpasses the previous Granite 4.0-H-Small (32B-A9B MoE) despite using fewer parameters and a simpler dense architecture. All models support up to 512K context windows and are released under Apache 2.0 license.

3 min read