OpenAI's Codex Desktop adds computer control and browser automation beyond coding
OpenAI's Codex Desktop can now control your computer, run background automations, and includes an in-app browser with click-to-select elements. The update adds automation memory across sessions and access to over 100 curated plugins, though the computer control feature is MacOS-only and unavailable in the EU.
OpenAI's Codex Desktop adds computer control and browser automation beyond coding
OpenAI announced a significant expansion of Codex Desktop, adding computer control capabilities, an in-app browser, and persistent automation memory. The update positions the tool beyond pure code generation, though OpenAI confirms it remains targeted at developers.
Computer control limited to MacOS
Codex Desktop can now operate applications in the background while users work in other programs. The computer control feature is MacOS-only at launch and not available in the European Union. No Windows or Linux support timeline was provided.
Browser automation with click-to-select
The update includes an in-app browser with the ability to click elements directly rather than describe their location. Users can select a specific page element and issue commands like "change this to that" without explaining positioning.
Automation memory and task continuity
Codex Desktop now maintains memory across sessions, addressing what OpenAI acknowledged was a "particularly problematic" limitation requiring the app to be brought up to speed on every relaunch. The system can "wake up automatically to continue on a long-term task, potentially across days or weeks," according to OpenAI.
Automations can be added to existing conversational threads, preserving context from earlier interactions. The app proactively suggests continuing previous workflows on launch.
Image generation and developer features
Codex Desktop adds ChatGPT's image generation capabilities for automated chart and diagram creation within workflows. Developer-focused updates include:
- GitHub review comment support
- Multiple terminal tabs (no color-coding yet)
- SSH connections to remote devboxes (early testing)
- File previews for PDFs, spreadsheets, slides, and documents
- Summary pane tracking agent plans and sources
100+ curated plugins
OpenAI ships Codex Desktop with access to over 100 plugins combining skills, app integrations, and MCP servers. The company curates plugins before availability, differentiating from Anthropic's Claude approach which faced malware issues from user-contributed skills.
Availability and usage
Codex Desktop is available to any OpenAI tier with Codex access on Mac and Windows. Computer control remains MacOS-exclusive. OpenAI warns that running extended automations will consume token allocations faster.
According to OpenAI, 80% of its staff use Codex, including non-programmers, though the company maintains the primary target remains developers.
What this means
Codex Desktop's expansion into computer control puts OpenAI in direct competition with Anthropic's Claude and other AI automation tools. The MacOS-only limitation for the headline feature suggests technical or safety constraints OpenAI hasn't resolved for other platforms. The EU restriction likely reflects regulatory concerns around autonomous computer control. The real test will be reliability—click-to-select and multi-day automations are impressive in demos but need to work consistently in production environments.
Related Articles
GitHub Copilot switches to token-based billing June 1, some users report costs jumping from $50 to $3,000
Microsoft is ending GitHub Copilot's flat-rate subscription model in favor of token-based billing starting June 1. Some developers report monthly costs rising from approximately $29-50 to $750-3,000, while others claim the increases only affect inefficient "vibe-coders" who iterate excessively without clear direction.
OpenAI's Codex for Windows gains Computer Use and remote control from ChatGPT mobile apps
OpenAI has expanded its Codex desktop app to Windows with Computer Use capabilities and remote control from ChatGPT mobile apps. The features, previously Mac-only, allow Codex to operate Windows desktop applications autonomously and enable iPhone, iPad, and Android users to initiate and monitor Codex tasks on Windows devices.
Google launches Gemini Spark AI agent for Ultra subscribers in US with automated task execution
Google has launched Gemini Spark, a 24/7 AI agent for Google AI Ultra subscribers in the US. The service automates tasks across Google Workspace apps with remote browser control, supporting up to 15 concurrent tasks with compute-based usage limits.
Microsoft 365 Copilot gains 2x faster load times and progressive disclosure interface
Microsoft is rolling out a redesigned Microsoft 365 Copilot that loads twice as fast, according to the company. The update introduces "progressive disclosure" — showing tools and controls contextually based on prompts rather than displaying all options at once.
Comments
Loading...