Gemma 4

Name: Gemma 4
Author: Google DeepMind

Google DeepMind🇺🇸 United States

active

Compare with other models →

Context window256K tokens

Version History

4.0majorApril 2, 2026

Gemma 4 introduces multimodal support, 256K context window, Apache 2.0 permissive licensing, and mixture of experts variant. First major version update with explicit focus on enterprise deployment without data usage restrictions.

1.0majorJanuary 8, 2025

Gemma 4 introduces multimodal support across text, image, video, and audio (small models), with context windows up to 256K, native reasoning modes, and function calling. Four model sizes (E2B 2.3B to 31B dense) target deployment from mobile to enterprise, with significant reasoning and coding benchmark improvements over Gemma 3.

Coverage

model releaseGoogle DeepMind

Google DeepMind releases Gemma 4 with four model sizes, up to 256K context, multimodal support

Google DeepMind released Gemma 4, an open-weights multimodal model family in four sizes (2.3B to 31B parameters) with context windows up to 256K tokens. All models support text and image input, with audio native to E2B and E4B variants. The Gemma 4 31B dense model scores 85.2% on MMLU Pro, 89.2% on AIME 2026, and 80.0% on LiveCodeBench—significant improvements over Gemma 3.

April 8, 2026 · 5:50 AM2 min read

google-deepmind gemma-4 multimodal

model release

Google launches Gemma 4 open-weights models with Apache 2.0 license to compete with Chinese LLMs

Google released Gemma 4, a new line of open-weights models available in sizes from 2 billion to 31 billion parameters, under a permissive Apache 2.0 license. The release includes multimodal capabilities, support for 140+ languages, native function calling, and a 256,000-token context window for the larger variants.

April 2, 2026 · 9:35 PM3 min read

gemma-4 google-deepmind open-weights