Google releases Gemini 3.5 Flash at half the price of frontier models, announces Omni world model
Google released Gemini 3.5 Flash, priced at half to one-third the cost of comparable frontier models, and announced it will become the default model in the Gemini app globally. The company also unveiled Omni, a world model for simulating physical environments, and Gemini Spark, an AI agent in beta testing.
Gemini 3.5 Flash — Quick Specs
Google releases Gemini 3.5 Flash at half the price of frontier models, announces Omni world model
Google released Gemini 3.5 Flash at its I/O developer conference on May 19, 2026, positioning the model at half to one-third the price of comparable frontier models according to CEO Sundar Pichai. The model will immediately become the default in the Gemini app and AI mode in search globally.
Pricing and performance
Google claims Gemini 3.5 Flash delivers "cutting-edge capabilities" at 50-67% lower cost than competing frontier models. Specific pricing per million tokens was not disclosed. Pichai described the model as "remarkably fast" in a pre-event briefing, with Google stating users "no longer have to trade quality for latency."
The company said it strengthened cybersecurity defenses to make 3.5 Flash "less likely to generate harmful content and mistakenly refuse to answer safe queries." No benchmark scores were provided.
Gemini 3.5 Pro delayed
Gemini 3.5 Pro, the heavier-weight version, is currently in internal use but won't launch publicly until June 2026. Google did not provide technical specifications or performance comparisons for the Pro version.
Omni world model for physical simulation
Google announced Omni, a world model designed to simulate physical environments and predict outcomes based on user actions. The model supports image and audio input and will be integrated into Flash, the Gemini App, Google Flow, and YouTube Shorts.
According to Google, Omni can "edit the action, add in new characters or objects" in user-uploaded videos. World models are primarily used in robotics and gaming applications. DeepMind has researched world models extensively, though Google did not specify how much of that research informed Omni's development.
Gemini Spark agent in beta
Google introduced Gemini Spark, an AI agent that can "reason across information in connected apps" and take actions on behalf of users. The agent launches in beta next week for trusted testers and Google AI Ultra subscribers only. No timeline for general availability was provided.
The agent represents Google's push into agentic AI as the company attempts to create deeper integrations across its product suite.
Market context
The announcements come as OpenAI and Anthropic, both reportedly preparing for IPOs in 2026, dominate AI market attention. Anthropic recently released its Mythos model, which the company claims discovered thousands of previously unknown software vulnerabilities.
Google's capital expenditure on AI infrastructure has increased significantly, with Wall Street expecting the company to demonstrate return on investment through product integrations.
What this means
Google's aggressive pricing on Gemini 3.5 Flash directly challenges OpenAI and Anthropic on cost, a critical factor for developers building at scale. The lack of disclosed benchmarks makes it impossible to verify performance claims against GPT-4 or Claude 3.5 Sonnet. Omni's world model capabilities could differentiate Google in robotics and video editing applications, but commercial viability depends on accuracy and reliability metrics not yet public. The staggered rollout of Spark and delayed Pro model suggest Google is still validating core capabilities before broad deployment.
Related Articles
Google releases Gemini 3.1 Flash Lite Image, its fastest and cheapest image generation model
Google has released Gemini 3.1 Flash Lite Image, also called Nano Banana 2 Lite, which the company describes as its fastest and cheapest image generation model. The model is available through Google's AI Studio and Gemini API with the identifier gemini-3.1-flash-lite-image.
Claude Sonnet 5 ships with 1M token context and new tokenizer that increases costs 30-40% for English text
Anthropic released Claude Sonnet 5 with a 1 million token context window and 128,000 token maximum output. The model removes traditional sampling parameters and introduces a new tokenizer that generates approximately 30% more tokens than Sonnet 4.6 for the same English text—effectively a significant price increase despite unchanged nominal rates of $3/million input and $15/million output tokens.
Google launches Nano Banana 2 Lite image model at 4 seconds per image, $0.04 per 1,000 generations
Google released Nano Banana 2 Lite, an image generation model that produces images in four seconds at under four cents per thousand images. The model prioritizes speed and cost over quality, targeting developers building high-volume image pipelines.
Google releases Nano Banana 2 Lite: 4-second image generation at $0.034 per 1,000 images
Google released Nano Banana 2 Lite, an AI image generator that produces images in 4 seconds and costs $0.034 per 1,000 images. The model is optimized for high-volume workflows and replaces the original Nano Banana as Google's entry-level image generation offering.
Comments
Loading...