uk-aisi
2 articles tagged with uk-aisi
April 14, 2026
analysisAnthropic
UK AI Safety Institute confirms Claude Mythos finds more exploits as token spend increases
The UK's AI Safety Institute published an independent evaluation confirming Anthropic's Claude Mythos is highly effective at finding security vulnerabilities. The evaluation revealed a linear relationship: more tokens spent equals more exploits discovered, transforming security into an economic arms race.
benchmarkAnthropic
Claude Mythos achieves 73% success rate on expert-level hacking challenges, completes full network takeover in 3 of 10 a
The UK's AI Safety Institute reports Claude Mythos Preview achieved a 73% success rate on expert-level capture-the-flag cybersecurity challenges and became the first AI model to complete a full 32-step simulated corporate network takeover, succeeding in 3 out of 10 attempts. The testing occurred in environments without active security monitoring or defenders.