UK AI Safety Institute confirms Claude Mythos finds more exploits as token spend increases

TL;DR

The UK's AI Safety Institute published an independent evaluation confirming Anthropic's Claude Mythos is highly effective at finding security vulnerabilities. The evaluation revealed a linear relationship: more tokens spent equals more exploits discovered, transforming security into an economic arms race.

April 14, 2026 · 7:50 PM2 min read

UK AI Safety Institute confirms Claude Mythos finds more exploits as token spend increases

The UK's AI Safety Institute (AISI) published an independent evaluation of Anthropic's Claude Mythos Preview, confirming the model's capabilities in identifying security vulnerabilities. The evaluation, titled "Our evaluation of Claude Mythos Preview's cyber capabilities," validates Anthropic's claims about the model's security testing effectiveness.

Token spend correlates with vulnerability discovery

The AISI report reveals a critical finding: Claude Mythos continues discovering exploits proportionally to token expenditure. According to analyst Drew Breunig, this creates a straightforward economic equation for cybersecurity: "to harden a system you need to spend more tokens discovering exploits than attackers will spend exploiting them."

This transforms security testing from a qualitative practice into a quantitative proof-of-work problem, where defensive spending must exceed offensive spending to maintain system integrity.

Open source economics shift

The findings have significant implications for open source software development. Because token investments in securing open source libraries can be amortized across all users of those libraries, shared security costs make open source projects economically more attractive.

This counters recent arguments that AI-powered "vibe-coding" — rapidly generating custom code replacements — would diminish the value of established open source libraries. The security economics now favor reusing well-tested shared libraries over custom implementations.

What this means

Cybersecurity now has a measurable cost floor: the token expenditure required to match potential attacker budgets. Organizations must budget not just for security tools, but for the computational cost of thorough AI-assisted vulnerability discovery. This creates a stark divide between well-funded projects that can afford extensive security testing and under-resourced projects that cannot.

The shift also strengthens the economic case for open source infrastructure, as communities can pool resources for security audits rather than each organization bearing the full cost independently. Security becomes a shared computational expense rather than duplicated effort.

Source: simonwillison.net ↗

claude-mythos anthropic uk-aisi cybersecurity ai-security open-source vulnerability-detection token-economics

fundingMay 29, 2026

Anthropic raises $65B at $965B valuation, releases Claude Opus 4.8, plans wider Mythos rollout

Anthropic closed a $65 billion Series H at a $965 billion valuation, making it the most valuable AI startup globally and surpassing OpenAI's $852 billion March valuation. The company simultaneously released Claude Opus 4.8 and announced plans to bring its Mythos cyber-focused model to all customers within weeks.

model releaseMay 23, 2026

Anthropic's Unreleased Claude Mythos Preview Finds 10,000+ Vulnerabilities in One Month

Anthropic's unreleased Claude Mythos Preview model has discovered more than 10,000 vulnerabilities across partner organizations in its first month of deployment through Project Glasswing. The company reports partners are finding bugs at 10x their previous rate, with Cloudflare discovering 2,000 bugs and Mozilla finding 271 Firefox vulnerabilities — 10x more than with previous Claude models.

analysisMay 14, 2026

Anthropic's Mythos Preview solves previously unsolvable cybersecurity test in updated checkpoint

A month after its initial release, a newer checkpoint of Anthropic's Mythos Preview became the first model to complete the UK AI Safety Institute's 'Cooling Tower' cyber range, solving it in 3 of 10 attempts. The model also completed 'The Last Ones' range in 6 of 10 attempts, surpassing OpenAI's GPT-5.5 and demonstrating capability improvements within a single model version.

model releaseMay 28, 2026

Anthropic's Opus 4.8 matches Claude Mythos Preview in alignment, cuts thinking mode costs by 67%

Anthropic released Claude Opus 4.8 on May 28, 2026, replacing Opus 4.7 at unchanged pricing. The company claims the model's misalignment rates match those of Claude Mythos Preview, the experimental model deemed too dangerous for public release in April 2026. Opus 4.8 delivers faster thinking modes at one-third the cost of version 4.7.

UK AI Safety Institute confirms Claude Mythos finds more exploits as token spend increases

UK AI Safety Institute confirms Claude Mythos finds more exploits as token spend increases

Token spend correlates with vulnerability discovery

Open source economics shift

What this means

Related Articles

Anthropic raises $65B at $965B valuation, releases Claude Opus 4.8, plans wider Mythos rollout

Anthropic's Unreleased Claude Mythos Preview Finds 10,000+ Vulnerabilities in One Month

Anthropic's Mythos Preview solves previously unsolvable cybersecurity test in updated checkpoint

Anthropic's Opus 4.8 matches Claude Mythos Preview in alignment, cuts thinking mode costs by 67%

Comments