Gemma 4 31B

Name: Gemma 4 31B
Author: Google DeepMind

Google DeepMind🇺🇸 United States

active

Compare with other models →

Context window256K tokens

Version History

4.0majorApril 2, 2026

Gemma 4 introduces multimodal capabilities (text, image, video, audio on small models), extended 256K context windows, configurable reasoning modes, and hybrid dense/mixture-of-experts architectures. Substantial improvements in coding benchmarks, long-context reasoning, and on-device deployment efficiency compared to Gemma 3.

Benchmark Scores

Full leaderboard →

89.2%

AIME 2024

89.2%

AIME 2025

84.3%

GPQA

80.0%

LiveCodeBench

85.2%

MMLU-Pro

88.4%

MMMU

Coverage

model releaseGoogle DeepMind

Google DeepMind releases Gemma 4 family with 256K context window and multimodal capabilities

Google DeepMind released the Gemma 4 family of open-weights models in four sizes (2.3B to 31B parameters) with multimodal support for text, images, video, and audio. The flagship 31B model achieves 85.2% on MMLU Pro and 89.2% on AIME 2024, with context windows up to 256K tokens. All models feature configurable reasoning modes and are optimized for deployment from mobile devices to servers under Apache 2.0 license.

April 2, 2026 · 9:35 PM3 min read

gemma-4 google-deepmind open-source-ai

model release

Google releases Gemma 4 family under Apache 2.0 license with 2B to 31B models

Google has released Gemma 4, a family of four open models ranging from 2B to 31B parameters, now available under the Apache 2.0 license for the first time. The 31B dense model ranks 3rd on the Arena AI Text Leaderboard, while the 26B mixture-of-experts variant ranks 6th, both outperforming significantly larger competitors. All models support multimodal inputs and are available on Hugging Face, Kaggle, and Ollama.

April 2, 2026 · 6:20 PM2 min read

gemma-4 google open-source