model release

Google launches Gemini 3.1 Flash Lite Image with 4-second generation time, $0.25 per 1M input tokens

TL;DR

Google has released Gemini 3.1 Flash Lite Image, a text-to-image model that generates 1K resolution images in approximately 4 seconds — 2.7× faster than Gemini 3.1 Flash Image. The model is priced at $0.25 per 1M input tokens and $1.50 per 1M output tokens, with a 66K context window and knowledge cutoff of January 2025.

2 min read
0

Google launches Gemini 3.1 Flash Lite Image with 4-second generation time

Google has released Gemini 3.1 Flash Lite Image, marketed internally as "Nano Banana 2 Lite," a text-to-image model designed for high-velocity developer pipelines. According to Google, the model generates images in approximately 4 seconds, making it 2.7× faster than Gemini 3.1 Flash Image.

Pricing and specifications

  • Input cost: $0.25 per 1M tokens
  • Output cost: $1.50 per 1M tokens
  • Context window: 66K tokens
  • Knowledge cutoff: January 2025
  • Release date: June 30, 2026 (according to listing)
  • Output resolution: 1K across 14 aspect ratios

Core capabilities

The model supports three primary functions through a single API endpoint: text-to-image generation, image editing, and multi-image composition. As a multimodal model, it returns both text and image outputs.

Google claims the model maintains "character consistency, precise editing, and real-world knowledge" from the Nano Banana family while optimizing for speed and cost efficiency.

All outputs include an invisible SynthID watermark for AI-generated content identification.

Technical positioning

Google positions this model as the "best balance of quality and speed" in the Nano Banana 2 line. The company states it's designed for prototyping, real-time applications, and visual workflows that require generating thousands of images at scale.

The model is available through OpenRouter's API infrastructure, which routes requests across multiple hosting providers based on price, speed, or specific provider requirements. OpenRouter's data indicates customers achieve 60-80% lower effective costs than list prices when using prompt caching with repeated context.

What this means

The 4-second generation time and sub-$0.30 input pricing puts this firmly in the rapid prototyping and high-volume application space. The 66K context window is unusually large for an image model, suggesting it can handle extensive text instructions or multi-turn editing conversations without losing context. However, the June 2026 release date listed appears inconsistent with current timeline — this may be a data error or placeholder value that requires clarification from Google.

Related Articles

model release

Google launches Nano Banana 2 Lite image model at 4 seconds per image, $0.04 per 1,000 generations

Google released Nano Banana 2 Lite, an image generation model that produces images in four seconds at under four cents per thousand images. The model prioritizes speed and cost over quality, targeting developers building high-volume image pipelines.

model release

Google releases Nano Banana 2 Lite: 4-second image generation at $0.034 per 1,000 images

Google released Nano Banana 2 Lite, an AI image generator that produces images in 4 seconds and costs $0.034 per 1,000 images. The model is optimized for high-volume workflows and replaces the original Nano Banana as Google's entry-level image generation offering.

model release

Google DeepMind releases Nano Banana 2 Lite at $0.034 per 1K image with 4-second generation, opens Gemini Omni Flash API

Google DeepMind released Nano Banana 2 Lite (gemini-3.1-flash-lite-image), its fastest image generation model with 4-second text-to-image latency priced at $0.034 per 1K-resolution image. The company also opened developer access to Gemini Omni Flash (gemini-omni-flash-preview) for video generation and editing at $0.10 per second of output.

model release

Claude Sonnet 5 ships with 1M token context and new tokenizer that increases costs 30-40% for English text

Anthropic released Claude Sonnet 5 with a 1 million token context window and 128,000 token maximum output. The model removes traditional sampling parameters and introduces a new tokenizer that generates approximately 30% more tokens than Sonnet 4.6 for the same English text—effectively a significant price increase despite unchanged nominal rates of $3/million input and $15/million output tokens.

Comments

Loading...