Gemma 4
1 article tagged with Gemma 4
June 9, 2026
changelogGoogle DeepMind
Google DeepMind Releases Quantization-Aware Training Versions of Gemma 4 Models in GGUF Format
Google DeepMind has released quantization-aware training (QAT) optimized versions of its Gemma 4 model family in GGUF Q4_0 format. The QAT versions preserve similar quality to bfloat16 while dramatically reducing memory requirements, with models available across the entire Gemma 4 lineup: E2B, E4B, 12B, 26B A4B, and 31B.