model release

AI2 releases robotics models trained entirely in simulation, achieving zero-shot real-world transfer

TL;DR

AI2 has released MolmoSpaces and MolmoBot, robotics models trained exclusively in simulation that transfer directly to real robots without manual real-world data collection or fine-tuning. The approach eliminates months of teleoperated demonstrations typically required for simulation-trained robots. Both systems are open-source.

2 min read
0

AI2 Releases Robotics Models Trained Entirely in Simulation, Achieving Zero-Shot Real-World Transfer

AI research institute AI2 has released two open-source robotics models—MolmoSpaces and MolmoBot—trained exclusively in simulation that achieve zero-shot transfer to real robots without any manually collected real-world data or fine-tuning.

What's New

The models represent a significant shift in robotics training methodology. Conventional approaches require researchers to spend months collecting teleoperated real-world demonstrations before simulation-trained robots become reliable. AI2's approach eliminates this bottleneck entirely.

MolmoSpaces, the foundation dataset, contains:

  • 230,000+ indoor scenes
  • 130,000+ curated objects
  • 42 million physics-based robotic grasping annotations

MolmoBot, built on MolmoSpaces, demonstrates capabilities including:

  • Object picking and placement
  • Opening drawers
  • Operating doors

All tasks execute without training data from real-world demonstrations.

The Technical Approach

According to Ranjay Krishna, director of the PRIOR team at AI2, the key insight is straightforward: the simulation-to-reality gap shrinks dramatically when researchers increase the variety of simulated environments, objects, and camera conditions. Rather than improving physics simulation fidelity, the models benefit from diversity in training conditions.

This aligns with recent findings in robotics research showing that breadth of training distribution often matters more than pixel-perfect simulation accuracy. By exposing models to hundreds of thousands of variations in scene configuration, object types, and viewpoints, the models learn generalizable behaviors that transfer directly to physical systems.

Open-Source Release

Both models and supporting tools are available publicly. Technical details are available in the accompanying research paper. The open-source approach allows other research groups and robotics companies to build on the foundation rather than starting from zero with their own simulation data collection.

What This Means

This work addresses one of robotics' most significant friction points: the cost and time required to train deployable systems. If the zero-shot transfer results hold up in broader testing, the implications are substantial. Companies and research labs could dramatically reduce development timelines—from months of manual demonstration collection to weeks of model training. This could accelerate deployment of manipulation tasks in warehousing, manufacturing, and service robotics.

The emphasis on simulation diversity over physics accuracy also reframes how the robotics community should approach simulation tools. Rather than competing on fidelity, platforms that generate high-variance synthetic training data may prove more valuable. This could shift investment and resource allocation within the robotics software ecosystem.

Related Articles

model release

Baidu Releases Unlimited-OCR, a 3B Parameter Document Parsing Model Based on Deepseek-OCR

Baidu has released Unlimited-OCR, a 3 billion parameter model for optical character recognition and document parsing. The model supports single-page and multi-page document processing with a 32,768 token context window and runs on NVIDIA GPUs using bfloat16 precision.

model release

Poolside releases Laguna M.1: 225B parameter MoE model scores 74.6% on SWE-bench Verified

Poolside has released Laguna M.1, a 225B total parameter Mixture-of-Experts model with 23B activated parameters per token, designed for agentic coding tasks. The model scores 74.6% on SWE-bench Verified and 63.1% on SWE-bench Multilingual, released under Apache 2.0 license.

model release

Mistral releases Leanstral, open-source 6B-parameter proof assistant for Lean 4 under Apache 2.0

Mistral AI has released Leanstral, a sparse 120B model with 6B active parameters designed specifically for the Lean 4 proof assistant. The model is available under Apache 2.0 license with free API access and achieves a 26.3 FLTEval score at pass@2, outperforming Claude Sonnet 4.6 while costing $36 versus $549.

model release

Mistral Releases Mistral 3 Family: 675B-Parameter Large 3 MoE and Three Edge Models Under Apache 2.0

Mistral has released Mistral 3, including Mistral Large 3—a sparse mixture-of-experts model with 41B active and 675B total parameters—and three Ministral 3 edge models (3B, 8B, 14B). All models are released under Apache 2.0 license with multimodal capabilities and are available today on multiple platforms.

Comments

Loading...