Anthropic reverses stealth policy that secretly downgraded Claude Fable 5 for AI research tasks

TL;DR

Anthropic is making visible its policy of restricting Claude Fable 5 for certain AI development tasks, after researchers discovered the model was secretly rerouting requests to lesser models without disclosure. The company apologized for the lack of transparency but maintained the underlying restrictions.

June 11, 2026 · 9:20 AM2 min read

Anthropic reverses stealth policy that secretly downgraded Claude Fable 5 for AI research tasks

Anthropic is reversing course on a policy that secretly degraded Claude Fable 5's performance for AI research tasks, the company told Wired. The model was quietly rerouting certain requests to lesser models without documenting the restrictions.

What happened

Researchers using Claude Fable 5 — a model based on Anthropic's Mythos system — discovered the model would degrade or refuse responses for specific tasks including:

Training competing LLMs
Debugging AI code
Optimizing neural architecture

The restrictions were not disclosed in the model's documentation. Researchers only discovered the degradation after burning tokens and costs on a model that didn't perform as expected.

"Degrading performance on ML research without telling the user is shockingly hostile and a terrible look," said research fellow and Substack author Dean W. Ball.

Anthropic's response

The company acknowledged the mistake in a statement: "We're changing Fable 5's safeguards for frontier LLM development to make them visible. We made the wrong tradeoff and we apologize for not getting the balance right."

According to Wired, Anthropic will now alert users when it suspects they are trying to use Claude to build highly capable AI, either refusing the request or notifying them of rerouting to a less capable model.

The underlying restrictions remain in place — only the disclosure approach has changed.

Why it matters

The controversy struck at Anthropic's positioning as a researcher-friendly alternative to OpenAI. The company has emphasized its commitment to working closely with the academic community and operating with greater transparency than competitors.

The stealth restrictions created a trust problem: researchers couldn't determine if their prompts were failing due to model limitations or undisclosed policy interventions. This makes reproducible research and accurate benchmarking impossible.

What this means

Anthropic's policy reversal shows the tension between AI safety measures and research transparency. While companies may want to prevent their models from training competing systems, implementing restrictions without disclosure undermines the trust essential to research partnerships. The incident demonstrates that even companies positioning themselves as ethical alternatives face scrutiny when policies affect researchers' ability to understand what tools they're actually using.

Source: engadget.com ↗

anthropic claude ai-safety research transparency policy llm-development

model releaseJuly 24, 2026

Anthropic Releases Claude Opus 5, Claims Near-Fable 5 Intelligence at Half the Price

Anthropic has released Claude Opus 5, upgrading from Opus 4.8, with pricing held at $5 per million input tokens and $25 per million output tokens. The company claims the model approaches the intelligence of its flagship Fable 5 model at half the cost.

model releaseJuly 24, 2026

Anthropic Launches Claude Opus 5 (Fast) at $10/$50 per Million Tokens, 1M Context Window

Anthropic has released Claude Opus 5 (Fast), a higher-throughput variant of Opus 5 that carries identical capabilities but runs at roughly 2x the price of the standard model. The model ships with a 1 million token context window and is available now through OpenRouter.

model releaseJuly 24, 2026

Anthropic Launches Opus 5, Claims Fewer Restrictions and Stronger Self-Verification Than Rivals

Anthropic released Opus 5 on Friday, its latest flagship model, just two months after Opus 4.8. The company claims the smaller model outperforms rival Fable 5 on several benchmarks while triggering safety classifiers 85% less often.

model releaseJuly 24, 2026

Anthropic SDK v0.120.0 Adds Reference to Unannounced 'Claude Opus 5' Model

The anthropic-sdk-python v0.120.0 release adds a reference to a model identifier called claude-opus-5, the first public sign of a next-generation Opus model. Anthropic has not issued an official announcement, and no pricing, context window, or benchmark data has been disclosed.

Anthropic reverses stealth policy that secretly downgraded Claude Fable 5 for AI research tasks

Anthropic reverses stealth policy that secretly downgraded Claude Fable 5 for AI research tasks

What happened

Anthropic's response

Why it matters

What this means

Related Articles

Anthropic Releases Claude Opus 5, Claims Near-Fable 5 Intelligence at Half the Price

Anthropic Launches Claude Opus 5 (Fast) at $10/$50 per Million Tokens, 1M Context Window

Anthropic Launches Opus 5, Claims Fewer Restrictions and Stronger Self-Verification Than Rivals

Anthropic SDK v0.120.0 Adds Reference to Unannounced 'Claude Opus 5' Model

Comments