Augment Code launches Cosmos, an operating system for multi-agent software development workflows
Augment Code has released Cosmos into public preview, positioning it as an operating system for agentic software development. The platform coordinates AI agents across the full software development lifecycle with shared memory, multi-model routing via their Prism system that claims 20-30% token savings, and what the company calls specialized agents that learn from team feedback.
Augment Code launches Cosmos, an operating system for multi-agent software development workflows
Augment Code has released Cosmos into public preview, positioning it as an operating system for agentic software development. The platform is designed to coordinate multiple AI agents across the entire software development lifecycle rather than serving as a single coding assistant.
Architecture and capabilities
Cosmos runs agents either in customer environments or Augment's cloud infrastructure. The system includes shared context and memory across agents, connections to development tools, and multi-model routing through what Augment calls Prism. According to the company, Prism delivers 20-30% token savings without quality degradation by selecting appropriate models for different tasks.
The platform introduces what Augment describes as "specialized agents" that store information from developer feedback. The company cites their internal testing agent "Milo" as an example, claiming it improves through conversational coaching rather than upfront context loading.
Workflow structure
Cosmos reduces the software development process to three human checkpoints, according to Augment:
- Prioritization review: Agents monitor feedback channels and propose daily priorities for human approval
- Spec and intent review: Humans review specifications before agents proceed with implementation
- Contextual understanding: A review experience focused on surfacing assumption changes rather than line-by-line code inspection
The company introduces what it calls "deep code review" — an agent-driven review process optimized for recall rather than precision, designed to catch all potential bugs when the reviewer is an AI agent rather than a human.
Availability and positioning
Cosmos is available now in public preview for MAX plan users. Pricing for the MAX plan was not disclosed. VP of Engineering Vinay Perneti wrote that the company is releasing early with "rough edges" to learn with teams experiencing what he describes as a disconnect: widespread individual agent adoption without corresponding organizational productivity gains.
The announcement frames Cosmos as infrastructure for what Augment calls "small teams of people working with large teams of agents." The company references Claude Opus 4.5's November release as a turning point when "a large portion of serious software engineers agreed it no longer makes sense to write most code by hand," though this claim is not independently verified.
What this means
Augment is betting that AI coding tools need orchestration layers rather than more powerful individual agents. The company's emphasis on model-agnostic routing addresses a real concern as frontier models become increasingly expensive for routine tasks. However, the core premise — that agents with memory and feedback loops solve organizational adoption challenges — remains unproven at scale. The public preview will test whether coordinating multiple specialized agents delivers measurable team-level productivity gains beyond individual developer efficiency.
Related Articles
Mistral releases Vibe 2.0 terminal coding agent with custom subagents and Devstral 2 API pricing
Mistral AI released Vibe 2.0, a terminal-native coding agent powered by Devstral 2, adding custom subagents, multi-choice clarifications, and slash-command skills. Devstral 2 API pricing is now $0.40/M input tokens and $2.00/M output tokens, with a smaller variant at $0.10/$0.30 per million tokens.
Google Gemini Live gains access to Memory and Connected Apps from past conversations
Google has updated Gemini Live to access past conversation history through Memory and Connected Apps. The feature, currently available in English in the US, allows the voice assistant to reference previous chats and information from YouTube, Workspace, Utilities, and image generation tools during conversations.
AWS Releases AgentCore Harness for Production AI Agents with Two-API Setup
Amazon Web Services made its AgentCore harness generally available, reducing production AI agent deployment to two API calls: CreateHarness and InvokeHarness. The managed service handles sandboxed execution, memory, tool integration, and observability, eliminating infrastructure setup for teams building LLM agents.
Mistral Rebrands Le Chat as Vibe, Launches Agentic Work and Code Modes with VS Code Extension
Mistral has rebranded Le Chat as Vibe, launching new agentic capabilities for long-running work tasks and software development. The platform now includes Work Mode for enterprise knowledge search and document synthesis, Code Mode with GitHub integration and sandboxed execution, and a new VS Code extension. Pricing starts at $14.99/month for Pro and $24.99/user/month for Team plans.
Comments
Loading...