analysis

Mozilla finds 423 Firefox security bugs in one month using Claude Mythos preview

TL;DR

Mozilla found 423 security bugs in Firefox during April 2026 using early access to Anthropic's Claude Mythos preview model — a 14x increase from their 20-30 monthly baseline. The company credits both improved model capabilities and refined techniques for filtering AI-generated findings.

2 min read
0

Mozilla finds 423 Firefox security bugs in one month using Claude Mythos preview

Mozilla identified 423 security vulnerabilities in Firefox during April 2026 using early access to Anthropic's Claude Mythos preview model, according to a detailed technical post published by the organization. The company's baseline was 20-30 security bug fixes per month throughout 2025.

The dramatic increase represents a 14x jump from Mozilla's typical monthly rate. February 2026 showed 61 fixes, March showed 76, before the April spike to 423.

From noise to signal

Mozilla attributes the results to two factors: improved model capabilities and refined harness techniques for steering and filtering model outputs. The company specifically addressed the recent problem of AI-generated security reports being "unwanted slop" that imposed asymmetric costs on maintainers.

"It is difficult to overstate how much this dynamic changed for us over a few short months," Mozilla wrote. "This was due to a combination of two main factors. First, the models got a lot more capable. Second, we dramatically improved our techniques for harnessing these models — steering them, scaling them, and stacking them to generate large amounts of signal and filter out the noise."

The bugs found included a 20-year-old XSLT vulnerability and a 15-year-old bug in the <legend> HTML element. Many attempted exploits identified by the AI harness were blocked by Firefox's existing defense-in-depth security measures.

Mythos model details unknown

Anthropic has not publicly announced Claude Mythos or disclosed its specifications. Mozilla had early access to the preview version, but context window size, pricing, benchmark scores, and release timeline remain undisclosed.

The finding process involved what Mozilla describes as "stacking" multiple models and implementing filtering systems to separate legitimate vulnerabilities from false positives — a critical requirement given the historical problem of low-quality AI-generated security reports overwhelming open source maintainers.

What this means

This represents the first documented case of an AI model materially accelerating security research at browser-scale. The 14x increase in legitimate vulnerability discoveries suggests newer models combined with proper filtering infrastructure can shift the economics of security auditing. However, Mozilla's success depended on early access to an unreleased model and significant engineering work to build filtering systems — resources not available to most open source projects. The case also validates concerns about AI-assisted vulnerability discovery: if Mozilla found 423 bugs this quickly, adversaries with similar access could potentially do the same.

Related Articles

analysis

US export controls force Anthropic to take Claude Fable 5 offline indefinitely

The US government imposed export controls on Anthropic's newly released Claude Fable 5 and underlying Mythos models on Friday, restricting access even for foreign nationals working at Anthropic in the United States. Anthropic took both models completely offline rather than risk non-compliance, leaving Fable unavailable to all users as of this writing.

analysis

Anthropic reverses course on invisible Claude Fable distillation guardrails after researcher backlash

Anthropic is making its anti-distillation safeguards visible in Claude Fable 5 after backlash over silently degrading responses when it detected attempts to use the model for training competing systems. Queries suspected of distillation will now be routed to Claude Opus 4.8 with explicit user notification, matching how the company handles other high-risk areas.

analysis

Anthropic reverses stealth policy that secretly downgraded Claude Fable 5 for AI research tasks

Anthropic is making visible its policy of restricting Claude Fable 5 for certain AI development tasks, after researchers discovered the model was secretly rerouting requests to lesser models without disclosure. The company apologized for the lack of transparency but maintained the underlying restrictions.

analysis

Anthropic's Claude Fable 5 Will Silently Degrade Responses on AI Research Topics

Anthropic's 319-page system card for Fable 5 and Mythos 5 reveals the company will silently limit the model's effectiveness on queries related to frontier AI development, including pretraining pipelines and ML accelerator design. Unlike other safety interventions, users will not be notified when these degradations occur.

Comments

Loading...