changelogAnthropic

Anthropic reverts three system changes that degraded Claude Code performance in March and April

TL;DR

Anthropic confirmed three separate system changes in March and April degraded Claude Code, Claude Agent SDK, and Claude Cowork performance. The company reduced default reasoning effort from high to medium on March 4, introduced a caching bug on March 26 that cleared session data with every turn, and added restrictive word limits on April 16 that caused a 3% performance drop.

2 min read
0

Anthropic Reverts Three System Changes That Degraded Claude Code Performance

Anthropic confirmed that Claude Code users who reported declining quality in March and April were correct. The company identified three distinct system changes that degraded performance across Claude Code, Claude Agent SDK, and Claude Cowork. The Claude API was not affected.

Three Separate Issues

Reasoning effort reduction (March 4): Anthropic changed Claude Code's default reasoning effort level from high to medium to reduce latency from extended thinking periods. According to the company, "This was the wrong tradeoff." The change was reverted April 7. The current Claude Code build v2.1.118 now defaults to "xhigh" reasoning effort on Sonnet 4.6.

Caching bug (March 26): Engineers introduced a bug while attempting to clear old output tokens for idle users after one hour. Instead of clearing cached thinking sessions only after idle periods, the bug cleared cached session data with every prompt-response turn. This made Claude "forgetful and repetitive." The issue was fixed April 10 for Sonnet 4.6 and Opus 4.6.

Response length limits (April 16): Anthropic added new system prompt instructions to reduce verbosity: "Length limits: keep text between tool calls to ≤25 words. Keep final responses to ≤100 words unless the task requires more detail." Internal testing before deployment suggested the change was safe, but post-deployment ablation tests revealed a 3% performance drop for both Opus 4.6 and 4.7. The system prompt change was reverted April 20.

Company Response

Anthropic emphasized it did not intentionally degrade its models. The company is implementing additional measures including more internal testing for Claude Code builds, improvements to its Code Review tool, better evaluation of system prompt changes, and a new @ClaudeDevs X account for detailed product communications.

The company reset usage levels for all customers following the issues. Head of growth Amol Avasare separately committed to more direct communication after previously addressing an unannounced A/B test through social media.

What This Means

This incident exposes the fragility of complex AI systems where seemingly minor adjustments cascade into measurable quality degradation. The 3% performance drop from word count restrictions reveals how difficult it is to predict the impact of system prompt changes, even with internal testing. More concerning is that Anthropic's initial testing failed to catch the caching bug and the performance impact of length restrictions—suggesting their pre-deployment evaluation framework needs strengthening. The month-long window between introducing issues and fully reverting them indicates a gap in real-time monitoring systems.

Related Articles

product update

Anthropic adds 15 lifestyle app integrations to Claude, including Spotify, Instacart, and Uber

Anthropic has expanded Claude's integration directory to include 15 lifestyle services including Spotify, Instacart, AllTrails, Uber, and Booking.com. The update shifts Claude's third-party connectivity from professional and educational tools to personal use cases, with apps now appearing dynamically within conversations.

product update

Anthropic adds personal app connectors to Claude for Spotify, Uber Eats, TurboTax

Anthropic has released app connectors allowing Claude to integrate directly with personal services including Spotify, Uber, Uber Eats, Instacart, TurboTax, Audible, AllTrails, and TripAdvisor. The connectors are available now across all Claude plans, with mobile support in beta.

product update

Claude adds Spotify integration with playlist generation, audio playback, and library management

Anthropic's Claude now integrates with Spotify's streaming service, allowing users to connect their accounts for personalized music and podcast recommendations. The integration includes audio preview, library saving, and playlist generation for Spotify Premium subscribers.

analysis

Mozilla finds 271 vulnerabilities in Firefox 150 using Anthropic's Claude Mythos Preview

Mozilla's Firefox engineering team identified 271 vulnerabilities for version 150 using Anthropic's Claude Mythos Preview, following a prior collaboration that yielded 22 security-sensitive fixes in version 148 using Opus 4.6. The findings demonstrate that AI models can now match elite human security researchers at discovering code vulnerabilities.

Comments

Loading...