Google DeepMind releases Nano Banana 2 Lite at $0.034 per 1K image with 4-second generation, opens Gemini Omni Flash API

TL;DR

Google DeepMind released Nano Banana 2 Lite (gemini-3.1-flash-lite-image), its fastest image generation model with 4-second text-to-image latency priced at $0.034 per 1K-resolution image. The company also opened developer access to Gemini Omni Flash (gemini-omni-flash-preview) for video generation and editing at $0.10 per second of output.

June 30, 2026 · 4:21 PM3 min read

Nano Banana 2 Lite (Gemini 3.1 Flash Lite Image) — Quick Specs

Output$34/1M tokens

Compare Nano Banana 2 Lite (Gemini 3.1 Flash Lite Image) with other models →

Google DeepMind releases Nano Banana 2 Lite at $0.034 per 1K image with 4-second generation, opens Gemini Omni Flash API at $0.10 per video second

Nano Banana 2 Lite targets high-throughput developer workflows where speed and cost matter more than maximum quality. According to Google, it's positioned as the recommended replacement for the original Nano Banana (gemini-2.5-flash-image).

Nano Banana 2 Lite specifications

Latency: 4 seconds for text-to-image generation
Pricing: $0.034 per 1K-resolution image
API name: gemini-3.1-flash-lite-image
Availability: Google AI Studio, Gemini API, Gemini Enterprise Agent Platform

Google claims the model maintains "reliable prompt adherence, strong character consistency and legible in-image text rendering" despite optimizations for speed. The company published internal benchmarks comparing Elo quality scores, latency, and cost against competitor models, though independent verification is not yet available.

The Nano Banana family now includes four models:

Nano Banana 2 Lite (Gemini 3.1 Flash Lite Image): Speed-optimized
Nano Banana 2 (Gemini 3.1 Flash Image): Balanced performance
Nano Banana Pro (Gemini 3 Pro Image): Quality-optimized for professional use
Nano Banana (Gemini 2.5 Flash Image): Legacy model, upgrade recommended

Gemini Omni Flash video capabilities

Gemini Omni Flash, first introduced at Google I/O, is now accessible via API in public preview. The model handles video generation and editing from text, image, and video inputs with conversational refinement.

Key specifications:

Pricing: $0.10 per second of video output (matching Veo 3.1 Fast)
API name: gemini-omni-flash-preview
Current output length: 10 seconds (longer durations coming)
Availability: Google AI Studio, Gemini API

Current limitations:

Video references up to 3 seconds accepted but not correctly processed
Audio reference uploads not yet supported
Character consistency issues across scene changes
Scene extension not available

The model uses Gemini's multimodal reasoning to synchronize text, graphics, and actions in generated video. Google positions it for conversational editing workflows where users iteratively refine outputs through natural language.

Combined workflow capabilities

Google published three demo applications showing Nano Banana 2 Lite generating initial images that Gemini Omni Flash then animates:

Anywhere: Generates landmark backgrounds, animates into video clips
Space Lift: Interior design visualization with cinematic walkthroughs
Omni Product Studio: E-commerce product video generation

The Interactions API supports up to three sequential edits with maintained session context. Both models include SynthID watermarking for content provenance.

What this means

Google is aggressively pricing Nano Banana 2 Lite to compete in the high-volume image generation market, undercutting alternatives where speed matters more than quality. The $0.034 per 1K image pricing and 4-second latency target prototyping and draft-heavy workflows. However, without independent benchmarks, developers need to validate quality claims against their specific use cases.

The Gemini Omni Flash API opening is more significant. At $0.10 per second, it's the first major video generation model with conversational editing officially available via API, though the 10-second limit and processing bugs (especially around video references) suggest it's genuinely preview-stage. The combined image-to-video pipeline could enable new product categories in e-commerce, marketing, and content creation—if the quality and reliability hold up at scale.

Source: deepmind.google ↗

google-deepmind nano-banana gemini-omni image-generation video-generation api-release pricing

model releaseJune 30, 2026

Anthropic releases Claude Sonnet 5 at $2/1M input tokens, 63.2% agentic coding benchmark

Anthropic has released Claude Sonnet 5, its new mid-tier model optimized for agentic tasks, priced at $2 per million input tokens through August 31 before rising to $3/1M. The model scores 63.2% on agentic coding benchmarks, approaching Opus 4.8's 69.2% performance at a significantly lower price point.

model releaseJune 30, 2026

Anthropic releases Claude Sonnet 5 with improved agentic capabilities, $2/$10 per million tokens through August

Anthropic has released Claude Sonnet 5, replacing Sonnet 4.6 as its medium-sized model. The company claims improved agentic performance approaching Opus 4.8 levels while maintaining lower pricing at $2 per million input tokens and $10 per million output tokens through August 31.

model releaseJune 27, 2026

OpenAI previews GPT-5.6 to select partners with three variants priced from $1 to $30 per million tokens

OpenAI has begun previewing its GPT-5.6 series to a limited group of trusted partners after government review. The release includes three variants: Sol at $5 input/$30 output per million tokens, Terra at $2.50/$15, and Luna at $1/$6.

model releaseJune 26, 2026

OpenAI releases GPT-5.6 in three tiers with limited government-coordinated rollout

OpenAI announced GPT-5.6, a three-tier model series launching through a limited preview coordinated with the U.S. government. The models—Sol, Terra, and Luna—are priced from $1/$6 to $5/$30 per million input/output tokens and introduce new max and ultra reasoning modes.