Google launches Gemini Omni Flash, multimodal video generation model available to AI Plus subscribers
Google has released Gemini Omni Flash, the first model in its new Gemini Omni family designed to generate video content from text, images, video, and audio inputs. The model is available now to AI Plus subscribers, with free access coming to YouTube Shorts and YouTube Create later this week.
Google launches Gemini Omni Flash, multimodal video generation model available to AI Plus subscribers
Google has released Gemini Omni Flash, the first model in its new Gemini Omni family designed to generate video content from multiple input types. The model is available now to subscribers of AI Plus and higher tiers, with free access coming to YouTube Shorts and YouTube Create later this week.
Model capabilities
Gemini Omni accepts text, images, video, and audio inputs (currently limited to speech samples) to create unified video outputs. According to Google, the model maintains a "cohesive, grounded world" with realistic physics and sound effects. Users can refine generated videos in subsequent turns after initial generation.
Google demonstrated the model generating a video of a rolling marble with what the company claims are believable physics for ball movement and convincing sound effects for bounces and bell rings. Another demo showed a claymation-style explainer video about protein folding.
Availability and pricing
Gemini Omni Flash is available immediately to AI Plus subscribers and higher tiers. Pricing for AI Plus subscription was not disclosed. Free access through YouTube Shorts and YouTube Create will launch later this week.
Google teased a higher-tier "Omni Pro" model with details coming soon.
Safety measures
All videos created by Gemini Omni include SynthID watermarking to identify AI-generated content. The model allows users to create personalized avatars of themselves for video generation. Audio and speech editing capabilities are not yet enabled — Google stated it will add this "until [it] can bring this capability to users responsibly."
Context
The release builds on Google's previous work with the Genie model for interactive video-game-style experiences and its Veo video generation models. Unlike Genie, which remains limited to AI Ultra subscribers, Google is positioning the Omni series for broader access.
What this means
Google is entering direct competition with OpenAI's Sora in consumer video generation, choosing broader distribution through free YouTube integration rather than a paid-only model. The phased rollout of audio editing capabilities and mandatory watermarking suggests the company is prioritizing safety controls over feature completeness at launch. The tiered model structure (Flash and Pro) mirrors Google's strategy across its Gemini lineup, though concrete capability differences and Pro pricing remain undisclosed.
Related Articles
Google launches Gemini 3.5 Flash and new Omni multimodal AI family at I/O 2026
Google launched Gemini 3.5 Flash today as the default model for its Gemini app and AI Mode in Search, with Gemini 3.5 Pro following next month. The company also introduced Gemini Omni, a new multimodal AI family capable of generating video from text, photos, video, and audio inputs.
Google releases Gemini Omni Flash video generation model with conversational editing, withholds speech synthesis
Google DeepMind released Gemini Omni Flash, the first model in its new Omni family that generates and edits video from image, audio, video, and text inputs. The model is rolling out to Gemini app subscribers and YouTube Shorts with a 10-second clip limit, while speech-editing capabilities remain withheld pending safety testing.
Google Releases Gemini 3.5 Flash with 1M Token Context and Configurable Thinking Modes at $1.50/$9 Per Million Tokens
Google has released Gemini 3.5 Flash, a multimodal model with a 1 million token context window priced at $1.50 per million input tokens and $9 per million output tokens. The model supports text, image, video, audio, and PDF inputs with configurable thinking effort levels from minimal to high.
Google releases Gemini 3.5 Flash with 4x faster output and agentic capabilities, 3.5 Pro coming June
Google released Gemini 3.5 Flash today with 4x faster output token generation than competing frontier models while surpassing Gemini 3.1 Pro on coding, agentic, and multimodal benchmarks. The company announced Gemini 3.5 Pro will launch next month and introduced Gemini Omni, a new multimodal series that outputs video.
Comments
Loading...