model-refusals

1 article tagged with model-refusals

April 23, 2026
changelogAnthropic

Claude Opus 4.7 refusal rate surges to 30+ monthly complaints as Anthropic tests aggressive guardrails

Anthropic's Claude Opus 4.7 release triggered a sharp increase in false positive refusals, with developers filing 30+ complaints in April 2026 compared to 2-3 monthly reports from July-September 2025. The company deployed aggressive Acceptable Use Policy guardrails to prepare for the eventual release of its Mythos vulnerability research model.