model releaseAnthropic

Anthropic releases Claude Opus 4.7 with reduced cyber capabilities compared to Mythos Preview

TL;DR

Anthropic released Claude Opus 4.7, a new model that the company says is 'broadly less capable' than its most powerful offering, Claude Mythos Preview. The model includes automated safeguards that detect and block prohibited or high-risk cybersecurity requests.

2 min read
1

Anthropic releases Claude Opus 4.7 with reduced cyber capabilities compared to Mythos Preview

Anthropic released Claude Opus 4.7 on Thursday, explicitly positioning it as "broadly less capable" than its most powerful model, Claude Mythos Preview. The company describes the new model as an improvement over previous versions in specific areas while intentionally limiting its cybersecurity capabilities.

According to Anthropic, Claude Opus 4.7 delivers better performance in software engineering, instruction following, completing real-world work tasks, and using file system-based memory. However, the company says its cyber capabilities fall short of Claude Mythos Preview, which Anthropic deployed to select companies earlier this month as part of Project Glasswing, a new cybersecurity initiative.

The model includes automated safeguards that detect and block requests indicating "prohibited or high-risk cybersecurity uses," according to the company's release. Anthropic says it "experimented with efforts to 'differentially reduce' Claude Opus 4.7's cyber capabilities during training."

Limited deployment strategy

Anthropic is treating Claude Opus 4.7 as a testing ground for safety measures before a broader release of Mythos-class models. "What we learn from the real-world deployment of these safeguards will help us work towards our eventual goal of a broad release of Mythos-class models," the company stated.

Security professionals interested in using Claude Opus 4.7 for "legitimate cybersecurity purposes" must apply through a formal verification program, rather than gaining direct access to the model.

Claude Mythos Preview remains available only to select companies participating in Project Glasswing. Anthropic has not disclosed pricing, benchmark scores, context window size, or other technical specifications for either model.

What this means

This marks a rare instance of a major AI company explicitly releasing a less capable version of its technology. Anthropic appears to be using differential capability reduction as a safety mechanism—maintaining strong performance in productive tasks while limiting potential misuse in cybersecurity applications. The approach suggests the company views Mythos-class models as too risky for general release without further safety research. Whether users will accept a deliberately hobbled model when competitors may not impose similar restrictions remains unclear.

Source: cnbc.com

Related Articles

model release

Anthropic releases Claude Opus 4.7 with reduced cyber capabilities ahead of Mythos Preview general release

Anthropic has released Claude Opus 4.7, its most powerful generally available model, though it scores lower than the company's Mythos Preview model on every evaluation. The company intentionally reduced Opus 4.7's cybersecurity capabilities during training as it tests safety measures before releasing more powerful models.

model release

Anthropic releases Claude Opus 4.7 with improved coding and vision, confirms it trails unreleased Mythos model

Anthropic released Claude Opus 4.7 with improved coding capabilities, higher-resolution vision, and a new reasoning level. The company publicly acknowledged the model underperforms its unreleased Mythos system, which remains restricted due to safety concerns.

model release

OpenAI releases GPT-5.4-Cyber, a fine-tuned variant for defensive cybersecurity work

OpenAI has released GPT-5.4-Cyber, a variant of GPT-5.4 fine-tuned specifically for defensive cybersecurity use cases. The release accompanies the company's Trusted Access for Cyber program, which allows users to verify their identity via government ID to gain access to cybersecurity-focused tools.

model release

OpenAI releases GPT-5.4-Cyber with tiered access verification system for cybersecurity work

OpenAI released GPT-5.4-Cyber, a model variant designed for defensive cybersecurity tasks with fewer restrictions on dual-use queries. Access is controlled through a tiered verification system in the Trusted Access for Cyber program, targeting thousands of vetted users compared to Anthropic's 40-organization Mythos Preview rollout.

Comments

Loading...