changelogAnthropic

Anthropic Launches Claude Opus 4.8 Fast Mode at 2x Price for Higher Output Speed

TL;DR

Anthropic has released a fast-mode variant of Claude Opus 4.8 that delivers higher output speed at double the pricing of the standard version. The model offers identical capabilities to regular Opus 4.8 with input pricing at $10 per million tokens and output at $50 per million tokens.

1 min read
0

Claude Opus 4.8 (Fast) — Quick Specs

Context window1000K tokens
Input$10/1M tokens
Output$50/1M tokens

Anthropic Launches Claude Opus 4.8 Fast Mode at 2x Price for Higher Output Speed

Anthropic has released a fast-mode variant of Claude Opus 4.8 that delivers higher output speed at double the pricing of the standard version, according to the model's listing on OpenRouter.

Pricing and Specifications

The fast-mode variant is priced at $10 per million input tokens and $50 per million output tokens—exactly twice the cost of regular Claude Opus 4.8. The model maintains a 1 million token context window, matching the standard version.

The release date is listed as May 27, 2026, though this appears to be placeholder data from OpenRouter's system.

Technical Details

According to Anthropic's documentation, the fast-mode variant offers identical capabilities to the standard Opus 4.8 model. The key differentiator is output speed: the fast variant generates tokens more quickly, making it suitable for applications where response latency matters more than cost efficiency.

The model routes through OpenRouter's provider network, which automatically selects the best available provider based on prompt size and parameters, with fallback options to maximize uptime.

What This Means

The fast-mode variant represents a straightforward trade-off: users pay double to get faster token generation without sacrificing model capabilities. This pricing structure follows a common pattern in AI APIs where premium tiers offer speed improvements for latency-sensitive applications like real-time chat interfaces or customer support systems. The 2x price multiplier for speed is lower than some competitors charge for similar performance tiers, though direct comparisons require benchmarking actual throughput numbers, which Anthropic has not yet disclosed.

Related Articles

model release

Anthropic releases Claude Opus 4.8 with improved agentic coding and reasoning benchmarks

Anthropic released Claude Opus 4.8 on May 28, 2026, with improved performance in agentic coding, computer use, and reasoning benchmarks. Pricing remains at $5 per million input tokens and $25 per million output tokens, while the model's fast mode is now three times cheaper than previous versions.

model release

Anthropic releases Claude Opus 4.8 with 69.2% agentic coding score, 2.5x faster performance

Anthropic released Claude Opus 4.8 on May 28, 2026, six weeks after version 4.7. The model achieves 69.2% on agentic coding benchmarks (up from 64.3%), runs 2.5 times faster in fast mode at one-third the cost, while maintaining the same pricing as version 4.7.

model release

Anthropic's Opus 4.8 matches Claude Mythos Preview in alignment, cuts thinking mode costs by 67%

Anthropic released Claude Opus 4.8 on May 28, 2026, replacing Opus 4.7 at unchanged pricing. The company claims the model's misalignment rates match those of Claude Mythos Preview, the experimental model deemed too dangerous for public release in April 2026. Opus 4.8 delivers faster thinking modes at one-third the cost of version 4.7.

model release

Anthropic's Claude Opus 4.8 launches on AWS Bedrock in four regions

Anthropic's Claude Opus 4.8 is now available on Amazon Bedrock and Claude Platform on AWS. The model is designed for autonomous multi-stage tasks, agentic coding, and long-running workflows with reduced supervision.

Comments

Loading...