Google releases Gemini 3.5 Flash with 4x faster output and agentic capabilities, 3.5 Pro coming June
Google released Gemini 3.5 Flash today with 4x faster output token generation than competing frontier models while surpassing Gemini 3.1 Pro on coding, agentic, and multimodal benchmarks. The company announced Gemini 3.5 Pro will launch next month and introduced Gemini Omni, a new multimodal series that outputs video.
Google releases Gemini 3.5 Flash with 4x faster output and agentic capabilities, 3.5 Pro coming June
Google released Gemini 3.5 Flash today at I/O 2026, claiming the model delivers 4x faster output tokens per second compared to other frontier models while maintaining Flash series pricing. According to Google, the model surpasses Gemini 3.1 Pro in coding, agentic, and multimodal benchmarks.
The model is available immediately in the Gemini app, Google Search, Antigravity 2.0, and via the Gemini API. Gemini 3.5 Pro is currently in testing and will launch next month.
New Gemini Omni multimodal series
Google introduced Gemini Omni, a new model family combining reasoning with content creation. Gemini Omni Flash accepts image, audio, video, and text inputs and outputs video "grounded in real-world knowledge" that can be edited. The model is rolling out to AI Plus, Pro, and Ultra subscribers in the Gemini app, Google Flow, and YouTube Shorts.
The company also launched Google Flow and Flow Music as standalone mobile apps, with Flow available on Android in beta and Flow Music launching on iOS first.
Gemini Spark agentic assistant
Google announced Gemini Spark, described as "your personal agent" that performs tasks autonomously. The system integrates with Gmail, Docs, and other Google Workspace apps, with third-party tool support via MCP (Model Context Protocol) coming this summer.
Gemini Spark will be available next week exclusively to Google AI Ultra subscribers in the US. The service transforms Gemini "from an assistant that can answer your questions into an active partner that does real work on your behalf," according to Google.
Pricing changes for Google AI plans
Google restructured its subscription tiers. AI Ultra now starts at $100 per month (previously $250), offering 5x higher usage limits than AI Pro. A new $200 tier replaces the previous $250 plan with identical capabilities.
The company is shifting from daily prompt limits to a "compute-used" model that accounts for prompt complexity, features used, and chat length. Limits refresh every five hours until reaching a weekly cap.
Search and productivity updates
AI Mode in Google Search now runs on Gemini 3.5 Flash. New "information agents" will monitor the web 24/7 for topics users specify, available to AI Pro and Ultra subscribers this summer. Google Search will also gain the ability to build custom dashboards and trackers for ongoing tasks.
Gmail Live, a conversational email search feature, rolls out to AI Pro and Ultra subscribers in the US this summer on Android and iOS. Docs Live for conversational document creation and editing launches simultaneously for the same subscriber tiers.
What this means
Google's 4x speed claim for Gemini 3.5 Flash positions it directly against Anthropic's Claude 3.5 Sonnet and OpenAI's GPT-4o in the fast inference tier. The introduction of Gemini Omni's video output capabilities represents a significant multimodal expansion, though real-world quality benchmarks remain to be seen. The restructured pricing and compute-based limits suggest Google is attempting to balance access with infrastructure costs as model capabilities increase. The MCP integration for Gemini Spark indicates Google is adopting industry standards for agent interoperability rather than building a proprietary ecosystem.
Related Articles
Google releases Gemini 3.5 Flash with autonomous coding and agent capabilities, claims 4x speed boost
Google released Gemini 3.5 Flash, positioning it as an agent-first model designed for autonomous coding and multi-hour workflows. The company claims the model outperforms its 3.1 Pro predecessor on coding and agentic benchmarks while running 4x faster than competing frontier models, with an optimized version achieving 12x speed gains.
Google releases Gemini 3.5 Flash at half the price of frontier models, announces Omni world model
Google released Gemini 3.5 Flash, priced at half to one-third the cost of comparable frontier models, and announced it will become the default model in the Gemini app globally. The company also unveiled Omni, a world model for simulating physical environments, and Gemini Spark, an AI agent in beta testing.
Google releases Gemini Omni Flash video generation model with conversational editing, withholds speech synthesis
Google DeepMind released Gemini Omni Flash, the first model in its new Omni family that generates and edits video from image, audio, video, and text inputs. The model is rolling out to Gemini app subscribers and YouTube Shorts with a 10-second clip limit, while speech-editing capabilities remain withheld pending safety testing.
Google Releases Gemini 3.5 Flash with 1M Token Context and Configurable Thinking Modes at $1.50/$9 Per Million Tokens
Google has released Gemini 3.5 Flash, a multimodal model with a 1 million token context window priced at $1.50 per million input tokens and $9 per million output tokens. The model supports text, image, video, audio, and PDF inputs with configurable thinking effort levels from minimal to high.
Comments
Loading...