Google DeepMind

Google's AI research lab, maker of Gemini

News

Google DeepMind connects Genie world model to 280 billion Street View images, Waymo already using for self-driving train

Google DeepMind has integrated its Genie world model with Street View's 280 billion images spanning 110 countries, enabling users to explore AI-generated simulations of real locations. Waymo is already using Genie 3 to train self-driving cars on rare scenarios like tornadoes and unexpected obstacles.

May 19, 2026 · 8:05 PM2 min read

google-deepmind world-models street-view

model releaseGoogle DeepMind

Google DeepMind Releases Gemma 4 E4B with Multi-Token Prediction for 2x Faster Inference

Google DeepMind released the Gemma 4 E4B assistant model using Multi-Token Prediction (MTP) architecture that accelerates inference by up to 2x through speculative decoding. The 4.5B effective parameter model supports 128K context windows and handles text, image, and audio input with pricing not yet disclosed.

May 10, 2026 · 5:06 AM3 min read

Gemma Google DeepMind multimodal

model releaseGoogle DeepMind

Google DeepMind Releases Gemma 4 26B A4B Assistant Model for 2x Faster Inference via Multi-Token Prediction

Google DeepMind has released a Multi-Token Prediction assistant model for Gemma 4 26B A4B that achieves up to 2x decoding speedup through speculative decoding. The model uses 3.8B active parameters from a 25.2B total parameter MoE architecture with 128 experts and a 256K token context window.

May 6, 2026 · 3:06 PM2 min read

gemma-4 google-deepmind mixture-of-experts

model releaseGoogle DeepMind

Google DeepMind releases Gemma 4 with 31B dense model, 256K context window, and speculative decoding drafters

Google DeepMind has released Gemma 4, a family of open-weight multimodal models including a 31B dense model with 256K context window and four size variants ranging from 2.3B to 30.7B effective parameters. The release includes Multi-Token Prediction (MTP) draft models that achieve up to 2x decoding speedup through speculative decoding while maintaining identical output quality.

May 6, 2026 · 1:51 AM3 min read

google-deepmind gemma-4 multimodal

model releaseGoogle DeepMind

Google DeepMind releases Gemini 3.1 Flash TTS with audio tags for precise speech control across 70+ languages

Google DeepMind launched Gemini 3.1 Flash TTS, a text-to-speech model that achieved an Elo score of 1,211 on the Artificial Analysis TTS leaderboard. The model introduces audio tags that allow developers to control vocal style, pace, and delivery through natural language commands embedded in text input, with support for 70+ languages.

April 15, 2026 · 4:21 PM2 min read

text-to-speech google-deepmind gemini

model releaseGoogle DeepMind

Google DeepMind releases Gemma 4 with four model sizes, up to 256K context, multimodal support

Google DeepMind released Gemma 4, an open-weights multimodal model family in four sizes (2.3B to 31B parameters) with context windows up to 256K tokens. All models support text and image input, with audio native to E2B and E4B variants. The Gemma 4 31B dense model scores 85.2% on MMLU Pro, 89.2% on AIME 2026, and 80.0% on LiveCodeBench—significant improvements over Gemma 3.

April 8, 2026 · 5:50 AM2 min read

google-deepmind gemma-4 multimodal

model releaseGoogle DeepMind

Google DeepMind releases Gemma 4 family: multimodal models from 2.3B to 31B parameters with 256K context

Google DeepMind released the Gemma 4 family of open-weights multimodal models in four sizes: E2B (2.3B effective parameters), E4B (4.5B effective), 26B A4B (3.8B active parameters), and 31B dense. All models support text and image input with 128K-256K context windows; E2B and E4B add native audio capabilities. Models feature reasoning modes, function calling, and multilingual support across 140+ languages.

April 6, 2026 · 9:05 AM3 min read

gemma google-deepmind multimodal

model releaseGoogle DeepMind

NVIDIA releases Gemma 4 31B quantized model with 256K context, multimodal capabilities

NVIDIA has released a quantized version of Google DeepMind's Gemma 4 31B IT model, compressed to NVFP4 format for efficient inference on consumer GPUs. The 30.7B-parameter multimodal model supports 256K token context windows, handles text and image inputs with video frame processing, and maintains near-baseline performance across reasoning and coding benchmarks.

April 4, 2026 · 5:50 AM2 min read

gemma nvidia quantization

model releaseGoogle DeepMind

Google DeepMind releases Gemma 4 with multimodal reasoning and up to 256K context window

Google DeepMind released Gemma 4, a multimodal model family supporting text, images, video, and audio with context windows up to 256K tokens. The release includes four sizes (E2B, E4B, 26B A4B, and 31B) designed for deployment from mobile devices to servers. The 31B dense model achieves 85.2% on MMLU Pro and 89.2% on AIME 2026.

April 4, 2026 · 12:50 AM3 min read

gemma google-deepmind multimodal

model releaseGoogle DeepMind

Google DeepMind releases Gemma 4 with four models up to 31B parameters, 256K context window

Google DeepMind released Gemma 4, an open-weights multimodal model family in four sizes (E2B, E4B, 26B A4B, 31B) with context windows up to 256K tokens and native reasoning capabilities. The 26B A4B variant uses Mixture-of-Experts architecture with 3.8B active parameters for efficient inference. All models support text, image input and handle 140+ languages with Apache 2.0 licensing.

April 3, 2026 · 6:35 AM2 min read

google-deepmind gemma open-weights

model releaseGoogle DeepMind

Google DeepMind releases Gemma 4, open multimodal models with 256K context and reasoning

Google DeepMind has released Gemma 4, a family of open-weights multimodal models ranging from 2.3B to 31B parameters with support for text, images, video, and audio. The models feature context windows up to 256K tokens, built-in reasoning modes, and native function calling for agentic workflows.

April 3, 2026 · 4:05 AM3 min read

gemma-4 google-deepmind open-source

model releaseGoogle DeepMind

Google DeepMind releases Gemma 4 open models with up to 256K context and multimodal reasoning

Google DeepMind has released Gemma 4, an open-weights model family in four sizes (2.3B to 31B parameters) with multimodal capabilities handling text, images, video, and audio. The 26B A4B variant uses mixture-of-experts to achieve 4B active parameters while supporting 256K token context windows and native reasoning modes.

April 3, 2026 · 12:05 AM3 min read

gemma google-deepmind open-models

model releaseGoogle DeepMind

Google DeepMind releases Gemma 4 family with 256K context window and multimodal capabilities

Google DeepMind released the Gemma 4 family of open-weights models in four sizes (2.3B to 31B parameters) with multimodal support for text, images, video, and audio. The flagship 31B model achieves 85.2% on MMLU Pro and 89.2% on AIME 2024, with context windows up to 256K tokens. All models feature configurable reasoning modes and are optimized for deployment from mobile devices to servers under Apache 2.0 license.

April 2, 2026 · 9:35 PM3 min read

gemma-4 google-deepmind open-source-ai

model releaseGoogle DeepMind

Google DeepMind releases Gemma 4 with 4 model sizes, 256K context, and multimodal reasoning

Google DeepMind released Gemma 4, a family of open-weights multimodal models in four sizes: E2B (2.3B effective), E4B (4.5B effective), 26B A4B (3.8B active), and 31B (30.7B parameters). All models support text and image input with 128K-256K context windows, while E2B and E4B add native audio capabilities and reasoning modes across 140+ languages.

April 2, 2026 · 8:50 PM3 min read

gemma-4 google-deepmind open-weights

model releaseGoogle DeepMind

Google DeepMind releases Gemma 4 open models with multimodal capabilities and 256K context window

Google DeepMind released the Gemma 4 family of open-source models with multimodal capabilities (text, image, audio, video) and context windows up to 256K tokens. Four distinct model sizes—E2B (2.3B effective parameters), E4B (4.5B effective), 26B A4B (3.8B active), and 31B—are available under the Apache 2.0 license, with instruction-tuned and pre-trained variants.

April 2, 2026 · 7:05 PM3 min read

google-deepmind open-source multimodal

model releaseGoogle DeepMind

Google DeepMind releases Gemma 4: multimodal models up to 31B parameters with 256K context

Google DeepMind released the Gemma 4 family of open-weights multimodal models in four sizes: E2B (2.3B effective), E4B (4.5B effective), 26B A4B (25.2B total, 3.8B active), and 31B dense. All models support text and image input with 128K-256K context windows, reasoning modes, and native function calling for agentic workflows.

April 2, 2026 · 6:20 PM2 min read

gemma google-deepmind multimodal

model releaseGoogle DeepMind

Google DeepMind releases Gemma 4: open models ranking #3 and #6 on Arena AI leaderboard

Google DeepMind released Gemma 4, a family of four open models ranging from 2B to 31B parameters, all licensed under Apache 2.0. The 31B dense model ranks #3 on Arena AI's text leaderboard and the 26B mixture-of-experts variant ranks #6, outperforming closed models significantly larger in size.

April 2, 2026 · 4:20 PM2 min read

gemma-4 google-deepmind open-source-models

researchGoogle DeepMind

Google Deepmind identifies six attack categories that can hijack autonomous AI agents

A Google Deepmind paper introduces the first systematic framework for 'AI agent traps'—attacks that exploit autonomous agents' vulnerabilities to external tools and internet access. The researchers identify six attack categories targeting perception, reasoning, memory, actions, multi-agent networks, and human supervisors, with proof-of-concept demonstrations for each.

April 1, 2026 · 5:20 PM3 min read

ai-security autonomous-agents prompt-injection

product updateGoogle DeepMind

Google DeepMind launches Lyria 3 Pro with 3-minute track generation and structural awareness

Google DeepMind introduced Lyria 3 Pro, an advanced music generation model capable of creating tracks up to 3 minutes long with structural awareness of musical composition elements like intros, verses, choruses, and bridges. The model is rolling out across multiple Google products including Vertex AI, Google Vids, Gemini app, and the new ProducerAI collaborative tool.

March 25, 2026 · 4:20 PM2 min read

google-deepmind lyria music-generation

changelogGoogle DeepMind

Google DeepMind's Gemini 3.1 Flash-Lite generates websites in real time, 2.5x faster than predecessor

Google DeepMind released Gemini 3.1 Flash-Lite, a model that generates functional websites in real time through a new pseudo-browser demo. The model achieves first response token 2.5 times faster than Gemini 2.5 Flash and outputs over 360 tokens per second, though output pricing has tripled from $0.40 to $1.50 per million tokens.

March 24, 2026 · 7:20 PM1 min read

gemini google-deepmind model-release

Models

Gemini 3.5 Flash

Google DeepMind

Google DeepMind

News

Google DeepMind connects Genie world model to 280 billion Street View images, Waymo already using for self-driving train

Google DeepMind Releases Gemma 4 E4B with Multi-Token Prediction for 2x Faster Inference

Google DeepMind Releases Gemma 4 26B A4B Assistant Model for 2x Faster Inference via Multi-Token Prediction

Google DeepMind releases Gemma 4 with 31B dense model, 256K context window, and speculative decoding drafters

Google DeepMind releases Gemini 3.1 Flash TTS with audio tags for precise speech control across 70+ languages

Google DeepMind releases Gemma 4 with four model sizes, up to 256K context, multimodal support

Google DeepMind releases Gemma 4 family: multimodal models from 2.3B to 31B parameters with 256K context

NVIDIA releases Gemma 4 31B quantized model with 256K context, multimodal capabilities

Google DeepMind releases Gemma 4 with multimodal reasoning and up to 256K context window

Google DeepMind releases Gemma 4 with four models up to 31B parameters, 256K context window

Google DeepMind releases Gemma 4, open multimodal models with 256K context and reasoning

Google DeepMind releases Gemma 4 open models with up to 256K context and multimodal reasoning

Google DeepMind releases Gemma 4 family with 256K context window and multimodal capabilities

Google DeepMind releases Gemma 4 with 4 model sizes, 256K context, and multimodal reasoning

Google DeepMind releases Gemma 4 open models with multimodal capabilities and 256K context window

Google DeepMind releases Gemma 4: multimodal models up to 31B parameters with 256K context

Google DeepMind releases Gemma 4: open models ranking #3 and #6 on Arena AI leaderboard

Google Deepmind identifies six attack categories that can hijack autonomous AI agents

Google DeepMind launches Lyria 3 Pro with 3-minute track generation and structural awareness

Google DeepMind's Gemini 3.1 Flash-Lite generates websites in real time, 2.5x faster than predecessor

Models

Gemini 3.5 Flash

Gemini 3.5 Flash

Gemini Omni Flash

Gemma 4 E4B Assistant

Gemini 3.1 Flash Lite

Gemma 4 31B IT Assistant (MTP Drafter)

Gemma 4 26B A4B Assistant

Google Gemini Flash Latest

Google Gemini Pro Latest

Gemma 4 E2B VLA

Gemini 3.1 Flash TTS

Gemini 3.1 Flash TTS

Gemma 4 E4B Instruction-Tuned

Gemma 4 26B A4B

Gemma 4 31B Dense

Gemma 4 31B Instruct

Gemma 4 E2B Instruction-Tuned

Gemma 4 31B

Gemma 4 31B Instruct

Veo 3.1 Lite

Google Lyria 3 Pro Preview

Google Lyria 3 Clip Preview

Gemini 3.1 Flash Live

Lyria 3 Pro

Gemini (Ask Maps)

Gemini 3.1 Flash Lite Preview

Nano Banana 2 (Gemini 3.1 Flash Image Preview)

Lyria 3

Gemini 3.1 Pro

Gemini 3.0 Pro

Gemini 2.5 Flash

Gemma 4 E2B

Gemma 4 26B A4B IT

Gemini 2.5 Pro

Gemini 2.0 Flash

Gemini 2.0 Flash-Lite

Gemini 2.0 Flash

Lyria 3 Pro

Gemma 4

Gemini 1.5 Flash

Gemini 1.5 Pro

Top Benchmark Scores