product update

Google preps Gemini agent for macOS to control computers and organize files, challenging Claude Cowork

TL;DR

Google is developing a Gemini agent for macOS that will control computers, organize files, and integrate with Google Workspace apps. Code analysis reveals features including file conversion to Google Sheets, folder organization, batch file renaming, and meeting follow-up automation.

2 min read
0

Google Preps Gemini Agent for macOS to Control Computers and Organize Files

Google is developing a Gemini agent for macOS with computer control capabilities to rival Anthropic's Claude Cowork, according to code analysis by 9to5Google's APK Insight team.

The agent will use macOS Screen Access and Accessibility features to control mouse and keyboard inputs, similar to Claude Cowork's computer use functionality released in recent months.

Four Core Capabilities Revealed

Code strings reveal four example prompts that define the agent's initial scope:

  1. Convert files to spreadsheets: Scan local folders for invoices or reports, extract data, and structure it into Google Sheets
  2. Organize folders: Find unorganized files in Desktop or Downloads, group by type or context, and archive
  3. Standardize files: Read file metadata and batch-rename files into organized subfolders
  4. Meeting follow-up: Extract Meet transcripts or Docs notes from recent meetings and draft follow-up emails with highlights and action items

The first three capabilities focus on local file management and Google Workspace integration. The fourth shifts focus to Google's productivity suite including Meet, Docs, and Gmail.

Current Gemini macOS App Limited

The existing Gemini macOS app offers two features: a native chat interface and an alt-space shortcut that allows sharing the current window for visual context. The upcoming agent capabilities would represent a significant expansion.

Broader Than Android Implementation

The macOS agent appears positioned for broader capabilities than Gemini's Android implementation. Currently, only select Android devices like the Galaxy S26 series can automate simple in-app tasks such as food ordering.

Google previously demonstrated computer control capabilities in the Gemini 2.5 Computer Use preview released last year, but has not yet shipped production features.

What This Means

Google is playing catch-up to Anthropic's Claude Cowork in the computer control agent space, focusing initially on file management and Google Workspace integration rather than general-purpose automation. The emphasis on Google Workspace suggests the company plans to leverage its productivity suite as a competitive advantage, potentially making the agent more valuable for organizations already committed to Google's ecosystem. No release timeline has been announced.

Comments

Loading...