DiffusionGemma 26B A4B IT

Google DeepMind🇺🇸 United States
active

Version History

26B-A4B-itmajor

Initial release of DiffusionGemma, a discrete diffusion-based text generation model built on Gemma 4 26B A4B MoE architecture with encoder-decoder design for parallel token generation.

26B-A4B-ITmajor

Google releases DiffusionGemma 26B as open-weight model under Apache 2 license, bringing diffusion-based text generation to production with 500+ tokens/second inference speed.

Coverage

model releaseGoogle DeepMind

Google DeepMind releases DiffusionGemma, a 26B parameter model generating 15-20 tokens per forward pass via discrete dif

Google DeepMind released DiffusionGemma, a 26B parameter mixture-of-experts model that generates text using discrete diffusion instead of autoregression. The model processes blocks of 256 tokens in parallel, achieving generation speeds exceeding 1100 tokens per second on H100 GPUs in low-batch settings.

3 min read