model releaseAnthropic

Anthropic's Opus 4.8 matches Claude Mythos Preview in alignment, cuts thinking mode costs by 67%

TL;DR

Anthropic released Claude Opus 4.8 on May 28, 2026, replacing Opus 4.7 at unchanged pricing. The company claims the model's misalignment rates match those of Claude Mythos Preview, the experimental model deemed too dangerous for public release in April 2026. Opus 4.8 delivers faster thinking modes at one-third the cost of version 4.7.

2 min read
0

Claude Opus 4.8 Reaches Mythos-Level Alignment

Anthropic released Claude Opus 4.8 on May 28, 2026, with alignment performance matching Claude Mythos Preview—the experimental model the company deemed too dangerous for public release in April 2026.

What Changed

Opus 4.8 replaces Opus 4.7 at unchanged pricing. According to Anthropic, the model delivers faster thinking modes at one-third the cost of version 4.7. The company claims 4.8 shows "substantially" lower misalignment rates than 4.7, reaching alignment levels comparable to Mythos Preview.

Anthropic states the model "reaches new highs on our measures of prosocial traits like supporting user autonomy and acting in the user's best interest," though the company did not provide specific definitions for these metrics.

Performance

Opus 4.8 improved coding performance over 4.7 on two benchmarks, though Anthropic did not specify which benchmarks or provide exact scores. The model does not surpass OpenAI's GPT-5.5 in coding tasks.

For comparison, Anthropic previously stated Opus 4.7 achieved a 92% honesty rate with reduced sycophancy and hallucinations compared to earlier versions.

Context: The Mythos Benchmark

Claude Mythos Preview emerged in April 2026 as Anthropic's most capable model to date, but the company withheld public release due to security concerns. Anthropic specifically highlighted Mythos as "strikingly capable at computer security tasks," prompting the company to establish Project Glasswing—a collaboration with Google, Nvidia, Microsoft, and Palo Alto Networks focused on defending critical software infrastructure.

That Opus 4.8's alignment now matches Mythos Preview suggests Anthropic has achieved significant safety improvements while maintaining capability levels previously considered too risky for deployment.

Pricing

Pricing remains unchanged from Opus 4.7. Specific per-token costs were not disclosed in the announcement.

What This Means

Anthropic is publicly shipping a model with alignment characteristics matching an experimental system it deemed unsafe for release just seven weeks earlier. This represents either a meaningful breakthrough in safety techniques or a recalibration of the company's risk tolerance. The one-third cost reduction for thinking modes addresses a key barrier to deploying reasoning-heavy models at scale. However, without disclosed benchmark scores or pricing details, independent verification of Anthropic's safety and performance claims remains impossible.

Related Articles

funding

Anthropic raises $65B at $965B valuation, releases Claude Opus 4.8, plans wider Mythos rollout

Anthropic closed a $65 billion Series H at a $965 billion valuation, making it the most valuable AI startup globally and surpassing OpenAI's $852 billion March valuation. The company simultaneously released Claude Opus 4.8 and announced plans to bring its Mythos cyber-focused model to all customers within weeks.

model release

Anthropic releases Claude Opus 4.8 with improved agentic coding and reasoning benchmarks

Anthropic released Claude Opus 4.8 on May 28, 2026, with improved performance in agentic coding, computer use, and reasoning benchmarks. Pricing remains at $5 per million input tokens and $25 per million output tokens, while the model's fast mode is now three times cheaper than previous versions.

model release

Anthropic's Claude Opus 4.8 launches on AWS Bedrock in four regions

Anthropic's Claude Opus 4.8 is now available on Amazon Bedrock and Claude Platform on AWS. The model is designed for autonomous multi-stage tasks, agentic coding, and long-running workflows with reduced supervision.

model release

Anthropic releases Claude Opus 4.8 with 69.2% agentic coding score, 2.5x faster performance

Anthropic released Claude Opus 4.8 on May 28, 2026, six weeks after version 4.7. The model achieves 69.2% on agentic coding benchmarks (up from 64.3%), runs 2.5 times faster in fast mode at one-third the cost, while maintaining the same pricing as version 4.7.

Comments

Loading...

Claude Opus 4.8: Alignment matches Mythos, thinking costs drop 67% | TPS