model releaseAnthropic

Anthropic restricts Claude Mythos to security researchers under Project Glasswing

TL;DR

Anthropic has not publicly released Claude Mythos, instead restricting access to a vetted set of partners through Project Glasswing. The company claims the model's cybersecurity research abilities—including finding thousands of high-severity vulnerabilities in major operating systems and browsers—warrant controlled deployment until industry safeguards mature.

2 min read
0

Anthropic Restricts Claude Mythos to Security Researchers Under Project Glasswing

Anthropic has withheld public release of Claude Mythos, instead limiting access through Project Glasswing, a controlled preview program for vetted security partners. The company cites the model's exceptional cybersecurity research capabilities as the reason for restricted deployment.

The Model and Its Capabilities

Claude Mythos is described as a general-purpose model comparable to Claude Opus 4.6. According to Anthropic's system card, the model has already discovered thousands of high-severity vulnerabilities, including flaws in every major operating system and web browser. Notably, the model can chain multiple vulnerabilities together—combining three, four, or sometimes five separate issues to create sophisticated multi-stage exploits that wouldn't be possible with individual vulnerabilities alone.

Real-World Vulnerability Discoveries

Nicholas Carlini, an Anthropic researcher, demonstrated the model's capabilities in Project Glasswing documentation. He reported finding "more bugs in the last couple of weeks than I found in the rest of my life combined." Specific discoveries include:

  • OpenBSD: A 27-year-old TCP SACK validation vulnerability that could crash the kernel. The flaw was patched on March 25, 2026 (CVE reference: OpenBSD 7.8 errata 025).
  • Linux: Multiple privilege escalation vulnerabilities allowing unprivileged users to gain administrator access.

These findings align with recent warnings from prominent security figures. Greg Kroah-Hartman of the Linux kernel noted that AI-generated security reports shifted from low-quality "AI slop" to credible, actionable vulnerabilities within the past month. Daniel Stenberg of curl reported spending hours daily handling AI-discovered security issues—"less slop but lots of reports. Many of them really good."

Project Glasswing Structure

The program provides $100M in usage credits plus $4M in direct donations to open-source security organizations. Partner organizations—including AWS, Apple, Microsoft, Google, and the Linux Foundation—receive early access to find and remediate vulnerabilities in foundational systems before broader vulnerability proliferation occurs.

Anthropic explicitly states: "We do not plan to make Claude Mythos Preview generally available," indicating this is a permanent access restriction, not a temporary embargo. The company plans to eventually enable safe large-scale deployment through safeguards designed to "detect and block the model's most dangerous outputs," with new protections expected in an upcoming Claude Opus model.

What This Means

This represents a significant shift in how frontier AI labs approach model release. Anthropic has essentially declared certain capabilities too risky for unrestricted distribution, betting that an industry preparation period—funded and coordinated through Project Glasswing—will reduce overall security risk more effectively than immediate public release would.

The question is whether this approach will hold. Other frontier labs like OpenAI (with GPT-5.4 already showing strong security research capabilities) have not adopted similar restrictions. A fragmented approach where some labs restrict access while others don't could create perverse incentives—security researchers and malicious actors alike may migrate to unrestricted alternatives, potentially accelerating rather than delaying vulnerability proliferation. Anthropic's success depends partly on industry adoption of their safeguards and on whether competitors follow suit.

Related Articles

product update

Anthropic launches Project Glasswing to defend critical software against AI-powered attacks

Anthropic has announced Project Glasswing, a new initiative to secure critical software infrastructure against AI-powered attacks. The project includes 11 major partners including Amazon, Apple, Google, Microsoft, and NVIDIA, and will use Claude Mythos Preview, an unreleased general-purpose model from Anthropic that claims to have found thousands of exploitable vulnerabilities across major operating systems and web browsers.

model release

Anthropic unveils Claude Mythos model, finds thousands of OS vulnerabilities via Project Glasswing

Anthropic has unveiled Claude Mythos, a new AI model designed for cybersecurity that has already discovered thousands of high-severity vulnerabilities in every major operating system and web browser. The model is being distributed as a preview to over 40 organizations and major technology partners including Apple, Google, Microsoft, and Amazon Web Services through Project Glasswing, a coordinated cybersecurity initiative.

changelog

Anthropic Python SDK v0.90.0 adds Claude Mythos preview support

Anthropic released version 0.90.0 of its Python SDK on April 7, 2026, adding support for the claude-mythos-preview model. The update also includes a bug fix for query parameter merging in the client.

model release

Anthropic previews Mythos, claims it found thousands of zero-day vulnerabilities in cybersecurity initiative

Anthropic unveiled a preview of Mythos, a frontier model it claims is the most powerful in its Claude lineup, for use in Project Glasswing—a cybersecurity initiative with 40+ partner organizations. According to Anthropic, Mythos identified thousands of zero-day vulnerabilities, many critical and up to two decades old, during early testing. The model will not be made generally available and is restricted to defensive security work by vetted partners.

Comments

Loading...