model release

Allen Institute Releases OlmoEarth v1.1 with 3x Compute Reduction for Satellite Imagery

TL;DR

Allen Institute (AI2) released OlmoEarth v1.1, a family of transformer-based models for satellite imagery processing that reduces compute costs by up to 3x compared to the original OlmoEarth v1. The efficiency gains come from collapsing Sentinel-2 resolution bands into single tokens, cutting sequence lengths by three times while maintaining benchmark performance.

May 19, 2026 · 6:51 PM2 min read

OlmoEarth v1.1 — Quick Specs

Compare OlmoEarth v1.1 with other models →

OlmoEarth v1.1: 3x Compute Reduction for Satellite Imagery Models

Allen Institute for AI (AI2) released OlmoEarth v1.1 on May 19, 2026, a family of transformer-based models that cuts compute costs by up to 3x compared to OlmoEarth v1 while maintaining performance on remote sensing benchmarks.

Technical Implementation

The efficiency gains stem from a fundamental redesign of how the model tokenizes Sentinel-2 satellite imagery. OlmoEarth v1 created separate tokens for each of Sentinel-2's three resolutions (10m, 20m, and 60m), generating 6 tokens per spatial patch per 2-timestep input. OlmoEarth v1.1 collapses these into single tokens, reducing token counts by three times.

This approach wasn't straightforward. AI2 reports that naive token merging caused a 10 percentage point drop on m-eurosat kNN, a standard remote sensing benchmark. The team modified their pretraining regimen to maintain cross-band relationship modeling, detailed in their technical report.

Model Family

AI2 released three model sizes:

Base
Tiny
Nano

All variants process Sentinel-2 data with tensors formatted as [H, W, T, D=12], where H and W represent latitudinal and longitudinal pixels, T is the temporal dimension, and D covers 12 Sentinel-2 channels.

Performance Trade-offs

AI2 states OlmoEarth v1.1 maintains similar performance to v1 on their benchmark mix and partner-constructed tasks, though the technical report notes "some regressions." Because both versions train on identical datasets, performance differences isolate methodological changes.

Compute costs in transformer models scale quadratically with token sequence length, making the 3x reduction in tokens significant for inference and fine-tuning. AI2 measured efficiency using MACs (multiply-accumulate operations per forward pass).

Deployment Context

Since OlmoEarth v1's November 2025 release, partners have deployed it for mangrove tracking, forest loss classification, and country-scale crop mapping. AI2 reports deployments now scale to national, continental, and global areas, with data export, preprocessing, inference, and post-processing dominated by compute costs.

What This Means

OlmoEarth v1.1 addresses the practical bottleneck in satellite imagery AI: compute cost at scale. A 3x reduction enables more frequent planet-scale map refreshes and lowers barriers for organizations without large compute budgets. For researchers, training on identical datasets to v1 creates a controlled comparison for studying pretraining methodologies in remote sensing.

The model family is available on Hugging Face with full training code. AI2 recommends existing OlmoEarth v1 users test v1.1 for their specific tasks given the documented performance regressions.

Source: huggingface.co ↗

OlmoEarth satellite imagery remote sensing transformer models Sentinel-2 Allen Institute AI2 compute efficiency

model releaseJuly 4, 2026

Mistral releases Leanstral 1.5: 119B parameter open-source model for Lean 4 proof assistance

Mistral AI has released Leanstral 1.5, an open-source 119B parameter mixture-of-experts model designed specifically for Lean 4 proof assistance. The model features 128 experts with 4 active per token (6.5B activated parameters), a 256k token context window, and multimodal input capabilities.

model releaseJuly 4, 2026

NVIDIA releases Nemotron-Labs-TwoTower-30B: block-wise diffusion model claims 2.42× faster generation at 98.7% baseline

NVIDIA released Nemotron-Labs-TwoTower-30B-A3B-Base-BF16, a block-wise diffusion language model that generates text by denoising blocks of tokens in parallel rather than sequentially. According to NVIDIA, the model achieves 2.42× the wall-clock generation throughput of its autoregressive baseline while retaining 98.7% of aggregate benchmark quality.

model releaseJuly 3, 2026

Mistral Releases Leanstral 1.5: 6B-Parameter Model Achieves 100% on miniF2F, Solves 587/672 PutnamBench Problems

Mistral AI released Leanstral 1.5, a free Apache-2.0 licensed model with 119B total parameters and 6B active parameters specialized for formal verification in Lean 4. The model achieves 100% on miniF2F benchmark, solves 587 of 672 PutnamBench problems at $4 per problem (versus $300+ for competitors), and reaches state-of-the-art 87% on FATE-H and 34% on FATE-X benchmarks.

model releaseJuly 1, 2026

Anthropic Restores Claude Fable 5 After Government Takedown, With Stricter Cybersecurity Blocks

Anthropic is redeploying Claude Fable 5 after a month-long government-mandated takedown triggered by Amazon researchers discovering a method to bypass the model's cybersecurity safeguards. The returning version includes enhanced safety classifiers that automatically block cybersecurity tasks and revert to Opus 4.8, with restricted availability through usage credits only.