model release

Google launches Gemini Omni Flash, multimodal video generation model available to AI Plus subscribers

TL;DR

Google has released Gemini Omni Flash, the first model in its new Gemini Omni family designed to generate video content from text, images, video, and audio inputs. The model is available now to AI Plus subscribers, with free access coming to YouTube Shorts and YouTube Create later this week.

2 min read
0

Google launches Gemini Omni Flash, multimodal video generation model available to AI Plus subscribers

Google has released Gemini Omni Flash, the first model in its new Gemini Omni family designed to generate video content from multiple input types. The model is available now to subscribers of AI Plus and higher tiers, with free access coming to YouTube Shorts and YouTube Create later this week.

Model capabilities

Gemini Omni accepts text, images, video, and audio inputs (currently limited to speech samples) to create unified video outputs. According to Google, the model maintains a "cohesive, grounded world" with realistic physics and sound effects. Users can refine generated videos in subsequent turns after initial generation.

Google demonstrated the model generating a video of a rolling marble with what the company claims are believable physics for ball movement and convincing sound effects for bounces and bell rings. Another demo showed a claymation-style explainer video about protein folding.

Availability and pricing

Gemini Omni Flash is available immediately to AI Plus subscribers and higher tiers. Pricing for AI Plus subscription was not disclosed. Free access through YouTube Shorts and YouTube Create will launch later this week.

Google teased a higher-tier "Omni Pro" model with details coming soon.

Safety measures

All videos created by Gemini Omni include SynthID watermarking to identify AI-generated content. The model allows users to create personalized avatars of themselves for video generation. Audio and speech editing capabilities are not yet enabled — Google stated it will add this "until [it] can bring this capability to users responsibly."

Context

The release builds on Google's previous work with the Genie model for interactive video-game-style experiences and its Veo video generation models. Unlike Genie, which remains limited to AI Ultra subscribers, Google is positioning the Omni series for broader access.

What this means

Google is entering direct competition with OpenAI's Sora in consumer video generation, choosing broader distribution through free YouTube integration rather than a paid-only model. The phased rollout of audio editing capabilities and mandatory watermarking suggests the company is prioritizing safety controls over feature completeness at launch. The tiered model structure (Flash and Pro) mirrors Google's strategy across its Gemini lineup, though concrete capability differences and Pro pricing remain undisclosed.

Related Articles

model release

Google launches Gemini 3.1 Flash Lite Image with 4-second generation time, $0.25 per 1M input tokens

Google has released Gemini 3.1 Flash Lite Image, a text-to-image model that generates 1K resolution images in approximately 4 seconds — 2.7× faster than Gemini 3.1 Flash Image. The model is priced at $0.25 per 1M input tokens and $1.50 per 1M output tokens, with a 66K context window and knowledge cutoff of January 2025.

model release

Google releases Nano Banana 2 Lite: 4-second image generation at $0.034 per 1,000 images

Google released Nano Banana 2 Lite, an AI image generator that produces images in 4 seconds and costs $0.034 per 1,000 images. The model is optimized for high-volume workflows and replaces the original Nano Banana as Google's entry-level image generation offering.

model release

Google releases Gemini 3.1 Flash Lite Image, its fastest and cheapest image generation model

Google has released Gemini 3.1 Flash Lite Image, also called Nano Banana 2 Lite, which the company describes as its fastest and cheapest image generation model. The model is available through Google's AI Studio and Gemini API with the identifier gemini-3.1-flash-lite-image.

model release

Google launches Nano Banana 2 Lite image model at 4 seconds per image, $0.04 per 1,000 generations

Google released Nano Banana 2 Lite, an image generation model that produces images in four seconds at under four cents per thousand images. The model prioritizes speed and cost over quality, targeting developers building high-volume image pipelines.

Comments

Loading...