model releaseGoogle DeepMind

Google DeepMind releases Nano Banana 2 Lite at $0.034 per 1K image with 4-second generation, opens Gemini Omni Flash API

TL;DR

Google DeepMind released Nano Banana 2 Lite (gemini-3.1-flash-lite-image), its fastest image generation model with 4-second text-to-image latency priced at $0.034 per 1K-resolution image. The company also opened developer access to Gemini Omni Flash (gemini-omni-flash-preview) for video generation and editing at $0.10 per second of output.

3 min read
0

Google DeepMind releases Nano Banana 2 Lite at $0.034 per 1K image with 4-second generation, opens Gemini Omni Flash API at $0.10 per video second

Google DeepMind released Nano Banana 2 Lite (gemini-3.1-flash-lite-image), its fastest image generation model with 4-second text-to-image latency priced at $0.034 per 1K-resolution image. The company also opened developer access to Gemini Omni Flash (gemini-omni-flash-preview) for video generation and editing at $0.10 per second of output.

Nano Banana 2 Lite targets high-throughput developer workflows where speed and cost matter more than maximum quality. According to Google, it's positioned as the recommended replacement for the original Nano Banana (gemini-2.5-flash-image).

Nano Banana 2 Lite specifications

  • Latency: 4 seconds for text-to-image generation
  • Pricing: $0.034 per 1K-resolution image
  • API name: gemini-3.1-flash-lite-image
  • Availability: Google AI Studio, Gemini API, Gemini Enterprise Agent Platform

Google claims the model maintains "reliable prompt adherence, strong character consistency and legible in-image text rendering" despite optimizations for speed. The company published internal benchmarks comparing Elo quality scores, latency, and cost against competitor models, though independent verification is not yet available.

The Nano Banana family now includes four models:

  • Nano Banana 2 Lite (Gemini 3.1 Flash Lite Image): Speed-optimized
  • Nano Banana 2 (Gemini 3.1 Flash Image): Balanced performance
  • Nano Banana Pro (Gemini 3 Pro Image): Quality-optimized for professional use
  • Nano Banana (Gemini 2.5 Flash Image): Legacy model, upgrade recommended

Gemini Omni Flash video capabilities

Gemini Omni Flash, first introduced at Google I/O, is now accessible via API in public preview. The model handles video generation and editing from text, image, and video inputs with conversational refinement.

Key specifications:

  • Pricing: $0.10 per second of video output (matching Veo 3.1 Fast)
  • API name: gemini-omni-flash-preview
  • Current output length: 10 seconds (longer durations coming)
  • Availability: Google AI Studio, Gemini API

Current limitations:

  • Video references up to 3 seconds accepted but not correctly processed
  • Audio reference uploads not yet supported
  • Character consistency issues across scene changes
  • Scene extension not available

The model uses Gemini's multimodal reasoning to synchronize text, graphics, and actions in generated video. Google positions it for conversational editing workflows where users iteratively refine outputs through natural language.

Combined workflow capabilities

Google published three demo applications showing Nano Banana 2 Lite generating initial images that Gemini Omni Flash then animates:

  • Anywhere: Generates landmark backgrounds, animates into video clips
  • Space Lift: Interior design visualization with cinematic walkthroughs
  • Omni Product Studio: E-commerce product video generation

The Interactions API supports up to three sequential edits with maintained session context. Both models include SynthID watermarking for content provenance.

What this means

Google is aggressively pricing Nano Banana 2 Lite to compete in the high-volume image generation market, undercutting alternatives where speed matters more than quality. The $0.034 per 1K image pricing and 4-second latency target prototyping and draft-heavy workflows. However, without independent benchmarks, developers need to validate quality claims against their specific use cases.

The Gemini Omni Flash API opening is more significant. At $0.10 per second, it's the first major video generation model with conversational editing officially available via API, though the 10-second limit and processing bugs (especially around video references) suggest it's genuinely preview-stage. The combined image-to-video pipeline could enable new product categories in e-commerce, marketing, and content creation—if the quality and reliability hold up at scale.

Related Articles

model release

Anthropic releases Claude Sonnet 5 at $2/1M input tokens, 63.2% agentic coding benchmark

Anthropic has released Claude Sonnet 5, its new mid-tier model optimized for agentic tasks, priced at $2 per million input tokens through August 31 before rising to $3/1M. The model scores 63.2% on agentic coding benchmarks, approaching Opus 4.8's 69.2% performance at a significantly lower price point.

model release

Anthropic releases Claude Sonnet 5 with improved agentic capabilities, $2/$10 per million tokens through August

Anthropic has released Claude Sonnet 5, replacing Sonnet 4.6 as its medium-sized model. The company claims improved agentic performance approaching Opus 4.8 levels while maintaining lower pricing at $2 per million input tokens and $10 per million output tokens through August 31.

model release

OpenAI previews GPT-5.6 to select partners with three variants priced from $1 to $30 per million tokens

OpenAI has begun previewing its GPT-5.6 series to a limited group of trusted partners after government review. The release includes three variants: Sol at $5 input/$30 output per million tokens, Terra at $2.50/$15, and Luna at $1/$6.

model release

OpenAI releases GPT-5.6 in three tiers with limited government-coordinated rollout

OpenAI announced GPT-5.6, a three-tier model series launching through a limited preview coordinated with the U.S. government. The models—Sol, Terra, and Luna—are priced from $1/$6 to $5/$30 per million input/output tokens and introduce new max and ultra reasoning modes.

Comments

Loading...