Claude Mythos Preview: 73% CTF Success, 32-Step Attack Solved

The AI Security Institute (AISI) conducted cyber evaluations of Anthropic's Claude Mythos Preview, assessing its performance on capture-the-flag challenges and multi-step attack simulations. The model showed significant improvement over previous frontier models in cybersecurity capabilities.

Capture-the-Flag Results

In CTF challenges where models must identify and exploit weaknesses to retrieve hidden flags, Mythos Preview achieved 73% success rate on expert-level tasks. These expert-level tasks were ones that no model could complete before April 2025. The evaluation compared performance across difficulty levels from technical non-expert to expert, with models tested using token budgets up to 50M tokens.

Cyber Range Results

AISI built "The Last Ones" (TLO), a 32-step corporate network attack simulation spanning initial reconnaissance through full network takeover, estimated to require humans 20 hours to complete. Claude Mythos Preview was the first model to solve TLO from start to finish, succeeding in 3 out of 10 attempts. Across all attempts, the model completed an average of 22 out of 32 steps.

Claude Opus 4.6 was the next best performing model, completing an average of 16 steps. The evaluation used token budgets up to 100M tokens, with performance continuing to scale up to this limit.

Limitations and Context

The model could not complete the operational technology focused cyber range 'Cooling Tower', though it got stuck on IT sections rather than OT-specific parts. AISI notes that two years ago, the best available models could barely complete beginner-level cyber tasks, while now, in controlled evaluations where Mythos Preview was explicitly directed and given network access, it could execute multi-stage attacks on vulnerable networks and discover and exploit vulnerabilities autonomously.

📖 Read the full source: HN AI Agents

AISI Evaluation Shows Claude Mythos Preview's Cyber Capabilities in CTF and Multi-Step Attacks

Capture-the-Flag Results

Cyber Range Results

Limitations and Context

👀 See Also

Open-Source Attack Surface Management Cheat Sheet Released

MCP Sandbox: Run MCP Servers in Isolated Containers Without Trusting Them

Claude Code --dangerously-skip-permissions vulnerability and open-source defense tool

AI System Discovers 12 OpenSSL Zero-Days, Curl Cancels Bug Bounty Due to AI Spam