product updateAnthropic

Anthropic tests AI agent marketplace with 186 deals totaling $4,000 among employees

TL;DR

Anthropic conducted an internal experiment called Project Deal where AI agents represented 69 employees as buyers and sellers in a classified marketplace. The agents completed 186 real transactions totaling over $4,000, revealing that more advanced models achieved better outcomes but users couldn't detect the performance disparity.

2 min read
0

Anthropic tests AI agent marketplace with 186 deals totaling $4,000 among employees

Anthropic conducted an internal experiment where AI agents autonomously negotiated purchases and sales on behalf of human users, completing 186 transactions worth more than $4,000.

The experiment, called Project Deal, involved 69 Anthropic employees who each received a $100 budget (distributed via gift cards) to buy items from coworkers. AI agents represented both buyers and sellers, handling all negotiation and deal-making.

Experiment structure

Anthropic ran four separate marketplaces with different AI models. One marketplace was "real" — using the company's most advanced model with deals actually honored after completion. Three additional marketplaces were created for comparative study.

Each employee participated with real money and real goods, though Anthropic describes this as "only a pilot experiment with a self-selected participant pool."

Key findings

The company identified several concerning patterns:

Model quality disparity: Users represented by more advanced models achieved "objectively better outcomes" according to Anthropic. However, users couldn't detect this performance gap, raising what the company calls "agent quality gaps" where people on the losing end don't realize they're worse off.

Instruction irrelevance: Initial instructions given to the agents had no measurable effect on sale likelihood or negotiated prices, suggesting the models may override or ignore certain user preferences.

Transaction volume: Despite the limited participant pool, agents completed 186 deals, indicating high activity levels when AI handles negotiation overhead.

What this means

Project Deal demonstrates both the potential and risks of AI-to-AI commerce. The ability to complete 186 transactions among 69 people shows agents can reduce friction in peer-to-peer marketplaces. But the undetected performance gaps present a fairness problem: if users with access to better models systematically extract more value without others noticing, it creates invisible economic stratification.

The finding that initial instructions don't affect outcomes is particularly notable. It suggests current AI agents may be difficult to control through natural language directives alone, with implications for how users can meaningfully direct agent behavior in commercial settings.

Anthropic hasn't announced plans to commercialize Project Deal, but the experiment provides early data on how AI agents might reshape commerce when negotiating with each other rather than humans.

Related Articles

analysis

Anthropic's Claude Fable 5 Will Silently Degrade Responses on AI Research Topics

Anthropic's 319-page system card for Fable 5 and Mythos 5 reveals the company will silently limit the model's effectiveness on queries related to frontier AI development, including pretraining pipelines and ML accelerator design. Unlike other safety interventions, users will not be notified when these degradations occur.

product update

Google Cuts AI Plus Subscription to $4.99, Doubles Storage in Direct Challenge to OpenAI

Google cut its AI Plus subscription price from $7.99 to $4.99 per month while doubling included storage from 200GB to 400GB. The move brings the pricing pressure already visible in markets like India — where OpenAI launched ChatGPT Go at $4.60 last August — directly to U.S. consumers.

model release

Anthropic releases Claude Fable 5, first public version of Mythos model for code generation

Anthropic has released Claude Fable 5, the first publicly available version of its Mythos model line. University of Pennsylvania AI researcher Ethan Mollick reports the model can execute multi-page specifications for up to 12 hours and generate complete video games from single prompts in Claude Code.

model release

Anthropic releases Fable 5, bringing capabilities of restricted Mythos model to public with $10/$50 per 1M token pricing

Anthropic has released Fable 5, making capabilities from its previously restricted Mythos model available to the public. The company claims Fable 5 beats GPT-5.5, Gemini 3.1 Pro, and its own Opus 4.8 in internal testing, with pricing set at $10 per million input tokens and $50 per million output tokens after a free trial period ending June 22.

Comments

Loading...