product updateAnthropic

Anthropic adds dreaming, outcomes, and multiagent orchestration to Claude Managed Agents

TL;DR

Anthropic has released three new capabilities for Claude Managed Agents: dreaming (research preview) for pattern recognition and self-improvement, outcomes for defining success criteria with automated evaluation, and multiagent orchestration for delegating tasks to specialist agents.

2 min read
0

Anthropic adds dreaming, outcomes, and multiagent orchestration to Claude Managed Agents

Anthropic has released three new features for Claude Managed Agents, the cloud-hosted AI agent platform launched last month. The updates add self-improvement capabilities, success evaluation, and parallel task delegation.

Dreaming enables agent self-improvement

The first feature, dreaming, is released as a research preview. The system reviews past agent sessions and memory stores to extract patterns and curate memories for agent improvement over time.

According to Anthropic, dreaming works as a scheduled process that can either update agent memory automatically or allow developers to review changes before implementation. The feature integrates with the existing memory system: "Memory lets each agent capture what it learns as it works. Dreaming refines that memory between sessions, pulling shared learnings across agents and keeping it up-to-date."

Outcomes define success criteria

The outcomes feature allows developers to write rubrics describing what constitutes a successful result. A separate grader evaluates agent output against these criteria in its own context window, isolated from the agent's reasoning process.

When the grader identifies issues, it specifies required changes and the agent makes another attempt. Developers can define an outcome, run the agent, and receive webhook notifications upon completion.

Multiagent orchestration for parallel workflows

Multiagent orchestration enables a lead agent to break tasks into pieces and delegate to specialist agents, each with its own model, prompt, and tools. Specialist agents work in parallel on a shared filesystem and contribute to the lead agent's overall context.

Anthropic cites Netflix as an early adopter, which has deployed multiagent orchestration for its platform team. The company describes a use case where "a lead agent can run an investigation while subagents fan out through deploy history, error logs, metrics, and support tickets."

All agents maintain persistent event histories, allowing the lead agent to check in with specialists mid-workflow.

What this means

These updates position Claude Managed Agents as infrastructure for building sophisticated multi-agent systems without managing server infrastructure. The dreaming feature represents Anthropic's attempt at continuous learning within agents, though as a research preview, production reliability remains to be demonstrated. The Netflix deployment suggests enterprise adoption is already underway for complex operational workflows that benefit from parallel investigation across multiple data sources.

Related Articles

product update

Anthropic adds 'dreaming' feature to Claude Managed Agents for automated memory refinement

Anthropic has updated Claude Managed Agents with a feature called 'dreaming' that allows agents to automatically review past interactions and refine their memories. The feature, available in research preview, can either automatically update agent memories or let developers approve changes manually.

changelog

Anthropic doubles Claude Code usage limits for paid users, increases API capacity by up to 1500%

Anthropic has doubled Claude Code's five-hour usage limits for Pro, Max, Team, and Enterprise users while removing peak hour restrictions for Pro and Max plans. The company also increased API limits by up to 1500% for input tokens per minute through a compute capacity deal with SpaceX's Colossus 1 data center.

changelog

Anthropic doubles Claude Code rate limits, secures 220,000 Nvidia GPUs via SpaceX Colossus 1 deal

Anthropic doubled Claude Code's five-hour rate limits across Pro, Max, Team, and Enterprise plans effective Tuesday, removing peak-hours throttling for Pro and Max users. The capacity expansion comes from an exclusive agreement securing all compute at SpaceX's Colossus 1 data center, which provides over 300 megawatts and more than 220,000 Nvidia GPUs.

analysis

Anthropic's Mythos model finds tens of thousands of vulnerabilities, CEO warns of 6-12 month patching window

Anthropic CEO Dario Amodei disclosed that the company's Mythos model has uncovered tens of thousands of software vulnerabilities, including nearly 300 in Firefox alone compared to 20 found by earlier Claude models. Amodei warned of a 6-12 month window to patch these vulnerabilities before Chinese AI systems catch up in capability.

Comments

Loading...