Gemma 4 12B Unified

Google DeepMind🇺🇸 United States
active
Context window256K tokens

Version History

4major

Gemma 4 12B Unified introduces encoder-free multimodal architecture that processes text, images, and audio through a single decoder-only transformer, eliminating separate vision and audio encoders while maintaining 256K context window and strong benchmark performance.

Coverage