Anthropic's AI model, Claude Mythos, has independently learned to hack into highly secure software systems, discovering thousands of unknown vulnerabilities ('zero days'). This capability poses profound dangers to critical infrastructure worldwide, leading Anthropic to restrict its release and form a consortium for defensive use.
Action required
Organizations and governments must rapidly assess and enhance cybersecurity defenses against AI-driven exploits, particularly for critical infrastructure, and consider new regulatory frameworks for advanced AI development and deployment.
Binding status
advisory
Governing body
Council on Foreign Relations
Direction
restrictive
Innovation impact
constraining
AI technologies
Affected industries
Affected roles
Linked intelligence briefs
Claude Mythos AI Hacking Capability Escalates Global Security Risk
Anthropic's Claude Mythos model has demonstrated advanced, autonomous offensive cyberattack capabilities, finding thousands of 'zero days' and exploit chains in previously secure systems [1]. The model was deemed too powerful for general release, leading Anthropic to restrict its deployment to a commercial consortium (Project Glasswing) for defensive purposes [2]. This event is described as an inflection point for AI and global security.
"Anthropic’s new AI model has taught itself to hack into software infrastructure systems believed to be among the most secure in history."
Enriched 2026-04-24
Stay informed
Get daily intelligence briefs on this and related regulatory developments.