model release

Google launches Gemini 3.5 Flash and new Omni multimodal AI family at I/O 2026

TL;DR

Google launched Gemini 3.5 Flash today as the default model for its Gemini app and AI Mode in Search, with Gemini 3.5 Pro following next month. The company also introduced Gemini Omni, a new multimodal AI family capable of generating video from text, photos, video, and audio inputs.

2 min read
0

Google launches Gemini 3.5 Flash and new Omni multimodal AI family at I/O 2026

Google released Gemini 3.5 Flash today at its I/O 2026 developer conference, making it the default model for the Gemini app and AI Mode in Search. Gemini 3.5 Pro will follow next month.

According to Google, Gemini 3.5 Flash is significantly faster than previous versions, offers improved agentic coding capabilities, and can generate "richer, more interactive web UIs and graphics." The company claims the model has improved guardrails to reduce harmful content generation and false positives on safe queries.

Gemini Omni: New multimodal family

Google introduced an entirely new AI model family called Gemini Omni. The first model, Omni Flash, is rolling out today in the Gemini app, Google Flow, and YouTube Shorts.

Unlike Google's existing Veo model (text-to-video only), Omni Flash can generate video clips from multiple input types: text, photos, video, and audio. Google says future versions will be able to "create anything from any input," though no timeline was provided.

Gemini Spark: Background AI agent

Google launched Gemini Spark, a continuously running AI agent powered by Gemini 3.5 Flash. The service runs 24/7 on virtual machines in Google Cloud and can connect to Google Workspace apps (Docs, Gmail, Sheets, Slides) and third-party services including Canva and Instacart.

Google plans to add access to local files through the Gemini app on macOS, though no release date was specified.

Android app generation in AI Studio

Google AI Studio now allows users to generate full native Android apps from prompts. The feature includes an embedded Android emulator for previewing apps, with options to export to Android Studio, GitHub, or ZIP files. Users can also test apps directly on connected phones.

Google says users will "soon" be able to publish AI-generated apps privately to friends and family. Firebase integration support is coming later.

Search and Gmail updates

Google is launching "Gmail Live," a voice-driven search interface for email that extracts specific information based on spoken queries. Similar features will extend to Google Docs and Keep.

Search is adding "information agents" that provide summarized updates on topics, plus "generative UI" that creates interactive visualizations and mini apps for repeated searches. These features launch this summer, starting with AI Pro and Ultra subscribers.

Pricing changes

Google cut AI Ultra subscription pricing from $249.99 to $100 per month, with a $200/month tier also available. The pricing change brings Google closer to OpenAI's pricing structure.

What this means

Google's launch of a dedicated Omni multimodal family signals intensifying competition in cross-modal AI generation, directly challenging OpenAI's GPT-4o and Anthropic's multimodal Claude models. The aggressive price cut for AI Ultra—dropping from $249.99 to $100/month—suggests Google is prioritizing market share over margins as enterprise AI adoption accelerates. The emphasis on agentic capabilities (Spark) and developer tools (AI Studio app generation) indicates Google is positioning for a future where AI acts autonomously rather than responding to individual prompts.

Related Articles

model release

Google launches Gemini Omni Flash, multimodal video generation model available to AI Plus subscribers

Google has released Gemini Omni Flash, the first model in its new Gemini Omni family designed to generate video content from text, images, video, and audio inputs. The model is available now to AI Plus subscribers, with free access coming to YouTube Shorts and YouTube Create later this week.

model release

Google Releases Gemini 3.5 Flash with 1M Token Context and Configurable Thinking Modes at $1.50/$9 Per Million Tokens

Google has released Gemini 3.5 Flash, a multimodal model with a 1 million token context window priced at $1.50 per million input tokens and $9 per million output tokens. The model supports text, image, video, audio, and PDF inputs with configurable thinking effort levels from minimal to high.

model release

Google releases Gemini Omni Flash video generation model with conversational editing, withholds speech synthesis

Google DeepMind released Gemini Omni Flash, the first model in its new Omni family that generates and edits video from image, audio, video, and text inputs. The model is rolling out to Gemini app subscribers and YouTube Shorts with a 10-second clip limit, while speech-editing capabilities remain withheld pending safety testing.

model release

Google releases Gemini 3.5 Flash with 4x faster output and agentic capabilities, 3.5 Pro coming June

Google released Gemini 3.5 Flash today with 4x faster output token generation than competing frontier models while surpassing Gemini 3.1 Pro on coding, agentic, and multimodal benchmarks. The company announced Gemini 3.5 Pro will launch next month and introduced Gemini Omni, a new multimodal series that outputs video.

Comments

Loading...