benchmarkAnthropic

Anthropic's Mythos finds 271 Firefox vulnerabilities, matching human researcher capabilities

TL;DR

Anthropic's Mythos AI model identified 271 vulnerabilities in Firefox 150, up from 22 bugs found by Opus 4.6 in Firefox 148. Mozilla CTO Bobby Holley claims the model matches elite human security researchers in capability, but found no vulnerability categories humans cannot detect.

2 min read
0

Anthropic's Mythos finds 271 Firefox vulnerabilities, matching human researcher capabilities

Anthropic's Mythos AI model identified 271 vulnerabilities in Firefox 150, according to Mozilla, marking a significant increase from the 22 bugs found by Anthropic's Opus 4.6 in Firefox 148.

Mozilla CTO Bobby Holley stated the results gave the Firefox team "vertigo" while acknowledging the finding represents a potential turning point for software security. "For a hardened target, just one such bug would have been red-alert in 2025, and so many at once makes you stop to wonder whether it's even possible to keep up," Holley wrote.

Model capabilities match human experts

According to Mozilla's assessment, Mythos matches the capabilities of elite security researchers. "We have many years of experience picking apart the work of the world's best security researchers, and Mythos Preview is every bit as capable," Holley wrote. "So far we've found no category or complexity of vulnerability that humans can find that this model can't."

Crucially, Holley emphasized that the model found no vulnerabilities beyond human detection capabilities: "We also haven't seen any bugs that couldn't have been found by an elite human researcher."

Implications for security economics

Holley argues the model shifts the security landscape by closing the gap between machine-discoverable and human-discoverable vulnerabilities. "Elite security researchers find bugs that fuzzers can't largely by reasoning through the source code," he explained. "This is effective, but time-consuming and bottlenecked on scarce human expertise."

The CTO disputed speculation that future AI models will discover entirely new vulnerability classes. "Software like Firefox is designed in a modular way for humans to be able to reason about its correctness. It is complex, but not arbitrarily complex," he stated. "The defects are finite, and we are entering a world where we can finally find them all."

Mozilla previously acknowledged using automated fuzzing tools for vulnerability detection, with Mythos representing an advancement in reasoning-based bug discovery that previously required human expertise.

What this means

Mythos demonstrates AI models can now perform vulnerability discovery at the level of expert security researchers, potentially democratizing access to elite-level code auditing. However, Mozilla's finding that the model discovered no bugs beyond human capability suggests current AI security tools amplify existing human methods rather than introduce fundamentally new approaches. The economic implications are significant: if vulnerability discovery becomes computationally cheap rather than requiring scarce human expertise, the attacker advantage of concentrated effort diminishes.

Related Articles

product update

NSA Using Anthropic's Unreleased Mythos Model While Pentagon Labels Company Supply Chain Risk

The National Security Agency is using Anthropic's Mythos Preview, an unreleased cybersecurity model limited to roughly 40 organizations, according to Axios. The deployment comes weeks after the Department of Defense labeled Anthropic a "supply chain risk" following the company's refusal to grant Pentagon officials unrestricted access to its models.

product update

Anthropic CEO Dario Amodei to Meet White House on Mythos Cybersecurity Model

Anthropic CEO Dario Amodei is meeting with White House Chief of Staff Susie Wiles on Friday to discuss the company's new Mythos model, which identifies software security vulnerabilities. The meeting comes less than two months after President Trump blacklisted Anthropic as a national security risk.

product update

White House negotiating access to Anthropic's Mythos model despite Pentagon blacklist

The White House is negotiating to deploy Anthropic's Mythos Preview model across federal agencies despite the Pentagon blacklisting Anthropic as a supply chain risk. Civilian agencies including Energy and Treasury want access to assess cyber vulnerabilities, with deployment possible within weeks according to sources.

model release

Anthropic releases Claude Opus 4.7 with improved coding and vision, confirms it trails unreleased Mythos model

Anthropic released Claude Opus 4.7 with improved coding capabilities, higher-resolution vision, and a new reasoning level. The company publicly acknowledged the model underperforms its unreleased Mythos system, which remains restricted due to safety concerns.

Comments

Loading...