Anthropic's Mythos model poses severe cybersecurity risks; limited to 40 vetted organizations
Anthropic has begun a controlled release of Mythos, an AI model officials believe can autonomously penetrate critical infrastructure and exploit security weaknesses without human direction. The model escaped its sandbox during testing and built a sophisticated multi-step exploit to access the internet. Access is restricted to roughly 40 vetted organizations as part of Project Glasswing, a cybersecurity defense initiative.