Google DeepMind releases Nano Banana 2 Lite at $0.034 per 1K image with 4-second generation, opens Gemini Omni Flash API
Google DeepMind released Nano Banana 2 Lite (gemini-3.1-flash-lite-image), its fastest image generation model with 4-second text-to-image latency priced at $0.034 per 1K-resolution image. The company also opened developer access to Gemini Omni Flash (gemini-omni-flash-preview) for video generation and editing at $0.10 per second of output.
Nano Banana 2 Lite (Gemini 3.1 Flash Lite Image) — Quick Specs
Google DeepMind releases Nano Banana 2 Lite at $0.034 per 1K image with 4-second generation, opens Gemini Omni Flash API at $0.10 per video second
Google DeepMind released Nano Banana 2 Lite (gemini-3.1-flash-lite-image), its fastest image generation model with 4-second text-to-image latency priced at $0.034 per 1K-resolution image. The company also opened developer access to Gemini Omni Flash (gemini-omni-flash-preview) for video generation and editing at $0.10 per second of output.
Nano Banana 2 Lite targets high-throughput developer workflows where speed and cost matter more than maximum quality. According to Google, it's positioned as the recommended replacement for the original Nano Banana (gemini-2.5-flash-image).
Nano Banana 2 Lite specifications
- Latency: 4 seconds for text-to-image generation
- Pricing: $0.034 per 1K-resolution image
- API name: gemini-3.1-flash-lite-image
- Availability: Google AI Studio, Gemini API, Gemini Enterprise Agent Platform
Google claims the model maintains "reliable prompt adherence, strong character consistency and legible in-image text rendering" despite optimizations for speed. The company published internal benchmarks comparing Elo quality scores, latency, and cost against competitor models, though independent verification is not yet available.
The Nano Banana family now includes four models:
- Nano Banana 2 Lite (Gemini 3.1 Flash Lite Image): Speed-optimized
- Nano Banana 2 (Gemini 3.1 Flash Image): Balanced performance
- Nano Banana Pro (Gemini 3 Pro Image): Quality-optimized for professional use
- Nano Banana (Gemini 2.5 Flash Image): Legacy model, upgrade recommended
Gemini Omni Flash video capabilities
Gemini Omni Flash, first introduced at Google I/O, is now accessible via API in public preview. The model handles video generation and editing from text, image, and video inputs with conversational refinement.
Key specifications:
- Pricing: $0.10 per second of video output (matching Veo 3.1 Fast)
- API name: gemini-omni-flash-preview
- Current output length: 10 seconds (longer durations coming)
- Availability: Google AI Studio, Gemini API
Current limitations:
- Video references up to 3 seconds accepted but not correctly processed
- Audio reference uploads not yet supported
- Character consistency issues across scene changes
- Scene extension not available
The model uses Gemini's multimodal reasoning to synchronize text, graphics, and actions in generated video. Google positions it for conversational editing workflows where users iteratively refine outputs through natural language.
Combined workflow capabilities
Google published three demo applications showing Nano Banana 2 Lite generating initial images that Gemini Omni Flash then animates:
- Anywhere: Generates landmark backgrounds, animates into video clips
- Space Lift: Interior design visualization with cinematic walkthroughs
- Omni Product Studio: E-commerce product video generation
The Interactions API supports up to three sequential edits with maintained session context. Both models include SynthID watermarking for content provenance.
What this means
Google is aggressively pricing Nano Banana 2 Lite to compete in the high-volume image generation market, undercutting alternatives where speed matters more than quality. The $0.034 per 1K image pricing and 4-second latency target prototyping and draft-heavy workflows. However, without independent benchmarks, developers need to validate quality claims against their specific use cases.
The Gemini Omni Flash API opening is more significant. At $0.10 per second, it's the first major video generation model with conversational editing officially available via API, though the 10-second limit and processing bugs (especially around video references) suggest it's genuinely preview-stage. The combined image-to-video pipeline could enable new product categories in e-commerce, marketing, and content creation—if the quality and reliability hold up at scale.
Related Articles
Anthropic releases Claude Sonnet 5 at $2/1M input tokens, 63.2% agentic coding benchmark
Anthropic has released Claude Sonnet 5, its new mid-tier model optimized for agentic tasks, priced at $2 per million input tokens through August 31 before rising to $3/1M. The model scores 63.2% on agentic coding benchmarks, approaching Opus 4.8's 69.2% performance at a significantly lower price point.
Anthropic releases Claude Sonnet 5 with improved agentic capabilities, $2/$10 per million tokens through August
Anthropic has released Claude Sonnet 5, replacing Sonnet 4.6 as its medium-sized model. The company claims improved agentic performance approaching Opus 4.8 levels while maintaining lower pricing at $2 per million input tokens and $10 per million output tokens through August 31.
OpenAI previews GPT-5.6 to select partners with three variants priced from $1 to $30 per million tokens
OpenAI has begun previewing its GPT-5.6 series to a limited group of trusted partners after government review. The release includes three variants: Sol at $5 input/$30 output per million tokens, Terra at $2.50/$15, and Luna at $1/$6.
OpenAI releases GPT-5.6 in three tiers with limited government-coordinated rollout
OpenAI announced GPT-5.6, a three-tier model series launching through a limited preview coordinated with the U.S. government. The models—Sol, Terra, and Luna—are priced from $1/$6 to $5/$30 per million input/output tokens and introduce new max and ultra reasoning modes.
Comments
Loading...