Anthropic's Mythos Found Hundreds of Bugs at $20K Each

Anthropic's Claude Mythos model cost $20,000 to find a single 27-year-old OpenBSD bug, and the entire Project Glasswing burned through a $100 million token budget. That's not a revolution - it's an expensive scaling exercise that most attackers can't afford.

Why the Panic Over Mythos Misses the Real Story

The UK AI Security Institute called Mythos the first model to complete "The Last One" cyber range - a full attack chain from reconnaissance to network takeover. GPT-5.4 and Opus 4.6 weren't far behind on the same benchmarks. Those benchmarks also lack active defenders and penalty for noise, so a real SOC would catch most Mythos-driven attacks before they pivoted.

Mythos's main marketing trick was highlighting vulnerabilities old enough to drive - like that 16-year-old FFmpeg bug. Old bugs are valuable because they affect more versions, but they're not harder to find. They just needed someone to look. Mythos looked, repeatedly: a thousand runs through its scaffold to snag that OpenBSD bug, at $20,000 a pop.

Open Models Are Catching Up Quickly

Without access to Mythos or Opus, DeepSeek runs decent in the cloud, while Gemma 4 and Qwen 3.6 punch above their weight in self-hosted setups - finding about half the vulnerabilities Mythos spotted in the benchmark. Aisle claimed the secret is all in the harness, not the model. That's half true: smaller models can detect vulnerabilities, but they can't produce valid exploits. Mythos-class models prove exploitability, which solves the false positive plague that killed earlier AI bug hunting.

Mozilla reported 271 findings with an extremely low false positive rate. Cloudflare said the false positive rate was "better than human testers." Those claims need wider verification, but if they hold, Mythos's real advantage is reducing noise - not finding bugs that others can't.

Where This Leaves the Cybersecurity Industry

Mythos brings new risks, but only for actors who already had advanced cyber resources. Not for the average script kiddie. The US government blocking Fable (Mythos's safeguard-heavy cousin) gives OpenAI time to catch up. The next interesting question: how fast can open models close the exploit gap? Because if Gemma 4 learns to weaponize its findings at DeepSeek's compute cost, the Mythos era might be shorter than Anthropic's PR campaign suggests.

Source: Post-Mythos Cybersecurity: Keep calm and carry on
Domain: cephalosec.com

Anthropic's Mythos Found Hundreds of Bugs at $20K Each

Why the Panic Over Mythos Misses the Real Story

Open Models Are Catching Up Quickly

Where This Leaves the Cybersecurity Industry

More in Cybersecurity