Anthropic's Opus 4.8 matches Claude Mythos Preview in alignment, cuts thinking mode costs by 67%

TL;DR

Anthropic released Claude Opus 4.8 on May 28, 2026, replacing Opus 4.7 at unchanged pricing. The company claims the model's misalignment rates match those of Claude Mythos Preview, the experimental model deemed too dangerous for public release in April 2026. Opus 4.8 delivers faster thinking modes at one-third the cost of version 4.7.

May 28, 2026 · 9:21 PM2 min read

Claude Opus 4.8 — Quick Specs

Input$5/1M tokens

Output$25/1M tokens

Compare Claude Opus 4.8 with other models →

Claude Opus 4.8 Reaches Mythos-Level Alignment

Anthropic released Claude Opus 4.8 on May 28, 2026, with alignment performance matching Claude Mythos Preview—the experimental model the company deemed too dangerous for public release in April 2026.

What Changed

Opus 4.8 replaces Opus 4.7 at unchanged pricing. According to Anthropic, the model delivers faster thinking modes at one-third the cost of version 4.7. The company claims 4.8 shows "substantially" lower misalignment rates than 4.7, reaching alignment levels comparable to Mythos Preview.

Anthropic states the model "reaches new highs on our measures of prosocial traits like supporting user autonomy and acting in the user's best interest," though the company did not provide specific definitions for these metrics.

Performance

Opus 4.8 improved coding performance over 4.7 on two benchmarks, though Anthropic did not specify which benchmarks or provide exact scores. The model does not surpass OpenAI's GPT-5.5 in coding tasks.

For comparison, Anthropic previously stated Opus 4.7 achieved a 92% honesty rate with reduced sycophancy and hallucinations compared to earlier versions.

Context: The Mythos Benchmark

Claude Mythos Preview emerged in April 2026 as Anthropic's most capable model to date, but the company withheld public release due to security concerns. Anthropic specifically highlighted Mythos as "strikingly capable at computer security tasks," prompting the company to establish Project Glasswing—a collaboration with Google, Nvidia, Microsoft, and Palo Alto Networks focused on defending critical software infrastructure.

That Opus 4.8's alignment now matches Mythos Preview suggests Anthropic has achieved significant safety improvements while maintaining capability levels previously considered too risky for deployment.

Pricing

Pricing remains unchanged from Opus 4.7. Specific per-token costs were not disclosed in the announcement.

What This Means

Anthropic is publicly shipping a model with alignment characteristics matching an experimental system it deemed unsafe for release just seven weeks earlier. This represents either a meaningful breakthrough in safety techniques or a recalibration of the company's risk tolerance. The one-third cost reduction for thinking modes addresses a key barrier to deploying reasoning-heavy models at scale. However, without disclosed benchmark scores or pricing details, independent verification of Anthropic's safety and performance claims remains impossible.

Source: zdnet.com ↗

anthropic claude opus-4-8 alignment model-safety mythos thinking-modes ai-safety

product updateJuly 14, 2026

Anthropic offers K-12 teachers free year of Claude Pro with educational tools through June 2027

Anthropic launched Claude for Teachers, offering K-12 educators in the United States free access to premium Claude features for one year. The program includes Claude Cowork, Claude Code, and education-focused skills developed with Learning Commons, with applications open until June 30, 2027.

changelogJuly 13, 2026

Anthropic launches rupee pricing for Claude in India at ₹2,000/month, its second-largest market

Anthropic has begun displaying rupee-denominated pricing for Claude subscriptions in India, its second-largest market after the US with 5.8% of global usage. Claude Pro is priced at ₹2,000 ($21) monthly when billed annually, compared to $17 in the US, with Indian prices including local taxes.

model releaseJuly 9, 2026

OpenAI announces GPT-5.6 with three models (Sol, Terra, Luna) and ChatGPT Work agent tool

OpenAI released GPT-5.6 in three model tiers—Sol (flagship reasoning), Terra (mainstream), and Luna (instant)—positioning them against Anthropic's Claude models. The company claims GPT-5.6 Sol scores 53.6 on Agents' Last Exam, 13.1 points above Claude Fable 5, while completing tasks 61% faster. ChatGPT Work, a desktop productivity agent similar to Claude Cowork, launches simultaneously for Pro, Enterprise, and Edu users.

changelogJuly 12, 2026

Anthropic extends Claude Fable 5 access through July 19 amid GPT-5.6 Sol competition

Anthropic has extended Claude Fable 5 access on all paid plans through July 19, 2026, marking another extension of the advanced model's availability. The extension comes after OpenAI released GPT-5.6 Sol, which is classified in the same Fable/Mythos model tier.