product updateGoogle DeepMind

Google DeepMind connects Genie world model to 280 billion Street View images, Waymo already using for self-driving train

TL;DR

Google DeepMind has integrated its Genie world model with Street View's 280 billion images spanning 110 countries, enabling users to explore AI-generated simulations of real locations. Waymo is already using Genie 3 to train self-driving cars on rare scenarios like tornadoes and unexpected obstacles.

2 min read
0

Google DeepMind connects Genie world model to 280 billion Street View images, Waymo already using for self-driving training

Google DeepMind announced at Google I/O that its Genie world model can now access Street View's 280 billion images captured across 110 countries and seven continents. The integration allows users to navigate AI-generated simulations of real locations, from snow-covered New York City blocks to London streets.

Genie 3 timeline and access

Genie 3 first appeared as a research preview in August 2025. DeepMind opened access to Google AI Ultra subscribers in the United States in January 2026. The Street View integration is now rolling out to some Ultra users in the US, with global expansion planned in coming weeks.

Waymo deployment for autonomous vehicle training

Waymo is already using Genie 3 in production to train self-driving cars on rare scenarios that would be dangerous or impractical to stage in real life, including tornadoes and unexpected encounters with elephants on roads. The Street View integration adds geographic realism to these training simulations.

Current limitations

According to Diego Rivas, product manager at DeepMind, the generated environments look closer to video games than photographs. The model is not yet physics-aware—demonstrations showed characters running through cacti without consequences. Research scientist Jack Parker-Holder estimates interactive world generation trails video generation by six to 12 months in accuracy.

For comparison, Google's Veo model already understands basic physics, and its Nano Banana tool can render accurate text in infographics. Genie has not reached that level.

Spatial continuity advantage

Jonathan Herbert, director of Google Maps, highlighted that Genie maintains spatial continuity. Users can turn 360 degrees inside a generated environment, and the AI remembers what was behind them rather than regenerating the scene from scratch with each viewpoint shift.

Two use cases

Parker-Holder identified two distinct audiences: robotics developers training agents in simulated environments that mirror actual locations, and ordinary users exploring for entertainment. The simulation-to-reality pipeline is a critical bottleneck in physical AI, with companies including Nvidia and Cadence working on similar problems.

What this means

Street View's dataset represents a competitive moat that no other AI lab can easily replicate—20 years of imagery across 110 countries. By connecting this data to a generative world model, Google transforms a passive mapping tool into an interactive training ground for robotics and autonomous vehicles. The Waymo deployment demonstrates immediate practical value beyond consumer exploration.

The six to 12-month lag behind video generation quality suggests rapid improvement is possible, but the current lack of physics awareness limits immediate applications. The spatial continuity feature indicates DeepMind is solving fundamental challenges in maintaining coherent 3D representations rather than simply generating impressive individual frames.

Related Articles

product update

Google Gemini Spark AI Agent Launches on Mac, Adds Real-Time Tracking and Third-Party App Integrations

Google has released Gemini Spark for macOS, bringing its AI agent to desktop computers for the first time. The update adds integrations with Google Keep and Tasks, plus third-party apps including Canva, Dropbox, Instacart, OpenTable, and Zillow Rentals.

product update

Google AI Plus at $4.99/month and AI Pro at $19.99/month expand Gemini context windows to 128K and 1M tokens

Google has detailed pricing and features for its Gemini app subscription tiers. AI Plus costs $4.99/month and includes 128,000 token context windows, while AI Pro at $19.99/month provides 1 million token context windows. Free users are limited to 32,000 tokens.

product update

Anthropic launches Claude Science beta with NVIDIA BioNeMo integration for life sciences research

Anthropic has launched the public beta of Claude Science, an AI workbench for scientific research that integrates NVIDIA's BioNeMo Agent Toolkit. The platform allows scientists to execute end-to-end research workflows using natural language commands to interact with digital agents.

product update

Apple ships Safari MCP server in Technology Preview 247, enabling AI coding agents to inspect and debug websites

Apple has released an MCP server for Safari Technology Preview 247 that allows AI coding agents to directly inspect and debug websites. The server gives agents access to console logs, network requests, screenshots, and DOM interactions through the Model Context Protocol standard created by Anthropic.

Comments

Loading...