computer-use
6 articles tagged with computer-use
Google adds screen selection tool to Chrome's Gemini panel, integrates computer use into Gemini 3.5 Flash API
Google has added a screen selection tool to Chrome 149's Gemini panel that allows users to capture text or images from their current tab for prompts. Separately, the company integrated computer use capabilities directly into the Gemini 3.5 Flash model API, replacing the standalone Gemini 2.5 Computer Use model.
H Company Ships Holo3.1 with Local Inference, Mobile Support, and 79.3% AndroidWorld Score
H Company released Holo3.1, a computer-use agent model family ranging from 0.8B to 35B parameters. The 35B-A3B variant scores 79.3% on AndroidWorld, up from 67% in Holo3. For the first time, H Company ships quantized checkpoints (FP8, Q4 GGUF, NVFP4) enabling local inference with 1.74× throughput gains and sub-4-second agent step times.
OpenAI's Codex for Windows gains Computer Use and remote control from ChatGPT mobile apps
OpenAI has expanded its Codex desktop app to Windows with Computer Use capabilities and remote control from ChatGPT mobile apps. The features, previously Mac-only, allow Codex to operate Windows desktop applications autonomously and enable iPhone, iPad, and Android users to initiate and monitor Codex tasks on Windows devices.
Holo3 achieves 78.85% on OSWorld benchmark with only 10B active parameters
H Company unveiled Holo3, a computer use model that scores 78.85% on the OSWorld-Verified benchmark—the highest on the leading desktop automation benchmark. The model achieves this with only 10B active parameters (122B total), positioning it as a lower-cost alternative to proprietary models like GPT 5.4 and Opus 4.6.
Anthropic releases Claude computer use feature to compete with OpenClaw
Anthropic announced Monday that Claude can now complete tasks on users' computers, including opening apps, navigating browsers, and filling spreadsheets, after receiving prompts from a smartphone. The feature positions Anthropic directly against OpenClaw, the viral AI agent that went mainstream this year. The capability comes with safeguards requiring Claude to request permission before accessing new applications.
OpenAI launches GPT-5.4 with native computer use capabilities for autonomous agents
OpenAI has launched GPT-5.4, its latest model with native computer use capabilities that allow it to operate computers and complete tasks across applications. The release represents a step toward autonomous AI agents that can handle complex jobs independently. The model includes advancements in reasoning, coding, and professional work with spreadsheets, documents, and presentations.