Gemma

4 articles tagged with Gemma

May 10, 2026
model releaseGoogle DeepMind

Google DeepMind Releases Gemma 4 E4B with Multi-Token Prediction for 2x Faster Inference

Google DeepMind released the Gemma 4 E4B assistant model using Multi-Token Prediction (MTP) architecture that accelerates inference by up to 2x through speculative decoding. The 4.5B effective parameter model supports 128K context windows and handles text, image, and audio input with pricing not yet disclosed.

April 22, 2026
model release

Gemma 4 VLA runs locally on NVIDIA Jetson Orin Nano Super with 8GB RAM, autonomous webcam tool-calling

NVIDIA engineer Asier Arranz demonstrated Gemma 4 running as a vision-language agent (VLA) on a Jetson Orin Nano Super with 8GB RAM. The model autonomously decides when to access a webcam based on user queries, with no hardcoded triggers—performing speech-to-text, vision analysis, and text-to-speech entirely locally.

April 21, 2026
product updateNVIDIA

NVIDIA Releases 7 Million Synthetic Korean Personas Dataset for AI Agent Localization

NVIDIA released Nemotron-Personas-Korea, a dataset containing 7 million demographically accurate synthetic personas grounded in official Korean statistics from KOSIS, Supreme Court of Korea, and the National Health Insurance Service. The dataset includes 26 fields per persona covering demographics, geography, and occupation across all 17 Korean provinces, with zero personally identifiable information under CC BY 4.0 license.

April 13, 2026
analysis

Google Gemma 4 Runs Locally on Edge Devices, Creating Enterprise Security Blind Spot

Google released Gemma 4, an open-weights model family that runs directly on edge devices with multi-step planning and autonomous workflow capabilities. The Apache 2.0 licensed model bypasses traditional cloud security controls by executing entirely on local hardware, creating a governance blind spot for enterprise security teams.