product update

Google adds screen selection tool to Chrome's Gemini panel, integrates computer use into Gemini 3.5 Flash API

TL;DR

Google has added a screen selection tool to Chrome 149's Gemini panel that allows users to capture text or images from their current tab for prompts. Separately, the company integrated computer use capabilities directly into the Gemini 3.5 Flash model API, replacing the standalone Gemini 2.5 Computer Use model.

2 min read
0

Google adds screen selection tool to Chrome's Gemini panel, integrates computer use into Gemini 3.5 Flash API

Google has added a screen selection tool to Chrome 149's Gemini panel that allows users to capture text or images from their current tab for prompts. Separately, the company integrated computer use capabilities directly into the Gemini 3.5 Flash model API, replacing the standalone Gemini 2.5 Computer Use model.

Chrome feature: Select from screen

The "Select from screen" tool appears in the Gemini panel's plus menu in Chrome 149. When activated, it highlights the current browser tab and prompts users to "Select any text or image to ask Gemini." The selected content is automatically added to the prompt box.

The feature is rolling out now with Chrome 149. Users who don't see it immediately can restart their browser to trigger the update.

Gemini 3.5 Flash gains native computer use

Google announced that Gemini 3.5 Flash now includes built-in computer use capabilities through the Gemini API. This native integration replaces the separate Gemini 2.5 Computer Use model that was previously available.

According to Google, developers can use the functionality to "build custom agents that can see, reason and take action across browser, mobile and desktop environments." The company claims improved performance for long-horizon and enterprise automation tasks, including continuous software testing and knowledge work across professional applications.

Google provided an example where 3.5 Flash uses computer use to "analyze the Gemini app and return a categorized list of features."

Safety controls for enterprise

Google has implemented safety features for enterprise customers:

  • Ability to require explicit user confirmation for sensitive or irreversible actions
  • Automatic task termination if an indirect prompt injection is detected

The computer use capabilities join existing Search and Maps grounding features in the Gemini API.

Availability

Gemini 3.5 Flash with computer use is available today through:

  • The Gemini API
  • A demo environment hosted by Browserbase
  • Documentation and reference implementation via the Gemini Enterprise Agent Platform

Pricing for the computer use capabilities was not disclosed.

What this means

The Chrome screen selection tool streamlines multimodal prompting by eliminating the need to manually screenshot and upload images. The integration of computer use into Gemini 3.5 Flash consolidates Google's agentic capabilities into its primary fast model, suggesting the company views browser/desktop automation as a core feature rather than a specialized use case. The safety controls around prompt injection and user confirmation indicate enterprise deployment concerns around autonomous agent actions.

Related Articles

product update

Mistral adds workspace-level connector controls, multi-account authentication, and debugging tools

Mistral AI released new enterprise connector features including workspace-level access controls, multi-account authentication for single connectors, and a debugging tool for Model Context Protocol (MCP) connections. The updates address production deployment challenges for AI agents accessing enterprise data systems.

product update

Google expands Gemini Android overlay menu with six new tools accessible without opening app

Google has expanded the Gemini overlay plus menu on Android to include six tools: Videos, Music, Canvas, and Guided Learning join the existing Images and Personal Intelligence options. The update, rolling out in Google app version 17.32, allows users to access most Gemini features from anywhere on Android without opening the full app.

product update

Vercel AI SDK adds Grok 4.3, reasoning effort controls, and image quality model support for xAI

Vercel released version 3.0.97 of its AI SDK's xAI integration, adding support for three new models: Grok 4.3, Grok Build 0.1, and Grok Imagine Image Quality. The update introduces reasoning effort controls with 'none' and 'medium' settings.

product update

Anthropic launches Claude Tag for Slack, writes 65% of its product team's code

Anthropic released Claude Tag, a beta feature that integrates Claude into Slack for Enterprise and Team customers. The company says the tool writes 65% of its product team's code and can work proactively with ambient mode enabled.

Comments

Loading...