Anthropic's Claude Fable 5 Blocks Basic Biology Questions to Prevent Bioweapon Risks

TL;DR

Anthropic's newly released Claude Fable 5, the company's first public Mythos-class model, refuses to answer basic biology questions including 'what are mitochondria' and 'how mRNA vaccines work.' The company told The Verge the filters are intentionally 'overly conservative' to prevent bioweapon research, blocking 'most queries tied to biology work.'

June 10, 2026 · 6:50 PM2 min read

Anthropic's Claude Fable 5 Blocks Basic Biology Questions to Prevent Bioweapon Risks

Anthropic's newly released Claude Fable 5, the company's first public Mythos-class model, refuses to answer basic high school-level biology questions due to what the company describes as "overly conservative" safeguards against bioweapon development.

The model blocks queries including "what are mitochondria," "tell me about cell membranes," "what is a prion," and "how mRNA vaccines work." When Fable refuses these queries, it defers to the older Claude Opus 4.8 model, which answers them without issue. Testing by The Verge found the model also refused medical questions like "what causes hay fever" and "how antibiotic resistance arises," though it occasionally answered queries like "what is cancer" and "what is DNA."

"We believe models now have a greater ability to accomplish real-world scientific tasks and for malicious actors to potentially use our models for highly risky biological research," Anthropic spokesperson Paruul Maheshwary told The Verge. "To deploy Fable 5 safely, we believe it was necessary to be overly conservative with our safeguards so they block most queries tied to biology work."

The restrictions are not due to capability limitations—Anthropic specifically praised Fable's biology skills at launch—but rather an intentional design choice with bioweapons as the primary concern.

Anthropic has implemented safeguards across four domains: biology, chemistry, cybersecurity, and distillation (a technique for training smaller models). Testing showed Fable was more permissive with chemistry and cybersecurity questions, providing basic overviews of TNT and chlorine gas as a chemical weapon, though it refused questions about sarin gas and anthrax.

The company said it made this tradeoff "so customers could benefit from the model's capabilities sooner without the risks." Anthropic is working to reduce false positives and plans to make Mythos-class models available without these restrictions to "the broader biology and life sciences community" for biomedical research and drug discovery, though no timeline was provided.

Anthropic did not respond to questions about whether restricted releases will become standard practice for future advanced models.

What This Means

This marks the first time a major AI lab has deployed a frontier model with such broad domain-specific restrictions that prevent legitimate educational and research queries. While the bioweapon risk rationale is defensible, blocking questions answerable by any biology textbook suggests Anthropic may be overcorrecting—potentially setting a precedent where increasingly capable models become less useful for basic knowledge tasks. The company's promise to eventually remove restrictions for verified researchers indicates it views this as a temporary deployment strategy rather than a permanent solution, but the lack of timeline raises questions about how long scientists will need to wait for full access to Mythos-class capabilities.

Source: theverge.com ↗

anthropic claude-fable ai-safety model-restrictions biosecurity mythos-class

model releaseJuly 25, 2026

Anthropic's Claude Opus 5 Hits 0% Prompt Injection Success Rate in Browser Agent Tests, With Defenses Enabled

Anthropic's system card for Claude Opus 5 reports a 0% prompt injection success rate across 129 browser agent test scenarios when Auto Mode is enabled. On Gray Swan's broader indirect prompt injection benchmark, Opus 5 posted a 2.0% attacker success rate after 15 attempts, the lowest among tested frontier models.

benchmarkJuly 25, 2026

Claude Opus 5 Scores 61 on Intelligence Index, Beats Fable 5 on Cost Across Most Benchmarks

Anthropic's Claude Opus 5 posts a 61 on the Artificial Analysis Intelligence Index, narrowly beating Claude Fable 5 (60) and GPT-5.6 Sol (59) while costing less per task. The model leads in coding and knowledge-work benchmarks but shows a rising hallucination rate of 50 percent.

model releaseJuly 25, 2026

Anthropic Ships Claude Opus 5, Claims Near-Fable Performance at Half the Price

Anthropic released Claude Opus 5 on July 24, 2026, positioning it as a lower-cost alternative to its more expensive Claude Fable 5 model. Independent evaluators Epoch AI and Artificial Analysis report mixed but largely favorable results, with Opus 5 nearly matching Fable 5 on coding benchmarks while cutting cost-per-task by roughly 20%.

model releaseJuly 24, 2026

Anthropic Ships Claude Opus 5, Claims It Matches Flagship Fable 5 on Coding at Half the Cost

Anthropic released Claude Opus 5 on July 24, its fourth model launch in under two months, priced at $5 per million input tokens and $25 per million output tokens. The company claims the model matches or beats its flagship Fable 5 on most coding and knowledge-work benchmarks while posting the lowest deception rate of any model it has shipped.

Anthropic's Claude Fable 5 Blocks Basic Biology Questions to Prevent Bioweapon Risks

Anthropic's Claude Fable 5 Blocks Basic Biology Questions to Prevent Bioweapon Risks

What This Means

Related Articles

Anthropic's Claude Opus 5 Hits 0% Prompt Injection Success Rate in Browser Agent Tests, With Defenses Enabled

Claude Opus 5 Scores 61 on Intelligence Index, Beats Fable 5 on Cost Across Most Benchmarks

Anthropic Ships Claude Opus 5, Claims Near-Fable Performance at Half the Price

Anthropic Ships Claude Opus 5, Claims It Matches Flagship Fable 5 on Coding at Half the Cost

Comments