Claude Mythos & Project Glasswing: When AI Gets Too Good at Hacking, It Becomes the Defenders' Weapon

    Claude Mythos & Project Glasswing: When AI Gets Too Good at Hacking, It Becomes the Defenders' Weapon

    11. April 20264 min read
    Till Freitag

    TL;DR: „Claude Mythos Preview finds zero-day vulnerabilities in every major operating system and browser – including bugs that went undetected for 27 years. Anthropic isn't releasing it publicly but deploying it defensively through Project Glasswing with 12 founding partners including AWS, Google, Microsoft, and Apple."

    — Till Freitag

    A Model Too Dangerous for Public Release

    On April 7, 2026, Anthropic did something unusual: announced a new frontier model – and simultaneously declared it would not be publicly available.

    Claude Mythos Preview is a general-purpose model that demonstrates one capability changing everything: it can find and exploit software vulnerabilities – better than virtually any human expert.

    This isn't a marketing claim. Mythos Preview has already found thousands of zero-day vulnerabilities – including critical bugs in every major operating system and every major web browser.

    What Mythos Preview Found

    Three examples illustrate the scale:

    1. A 27-year-old vulnerability in OpenBSD – an operating system known for its security. The bug allowed anyone to remotely crash any machine by simply connecting to it.

    2. A 16-year-old bug in FFmpeg – software used in countless applications for video encoding. Automated testing tools had hit this line of code five million times without catching the flaw.

    3. A Linux kernel exploit chain – the model autonomously found and chained multiple vulnerabilities to escalate from ordinary user access to complete machine control.

    The remarkable part: Mythos Preview found most of these vulnerabilities entirely autonomously – without any human steering.

    The Leap Over Opus 4.6

    The numbers are dramatic. On the CyberGym benchmark, Mythos Preview scores 83.1% – compared to 66.6% for Opus 4.6.

    Even more striking is the exploit comparison: in a Firefox JavaScript engine test, Opus 4.6 could develop a working exploit in only 2 out of several hundred attempts. Mythos Preview succeeded 181 times.

    General coding benchmarks tell the same story:

    • SWE-bench Verified: 93.9% (vs. 80.8%)
    • SWE-bench Pro: 77.8% (vs. 53.4%)
    • Terminal-Bench 2.0: 82.0% (vs. 65.4%)

    These capabilities weren't explicitly trained – they emerged as a side effect of improved code, reasoning, and autonomy capabilities.

    Project Glasswing: The Defense Initiative

    Instead of making Mythos Preview public, Anthropic launched Project Glasswing – named after the glasswing butterfly with its transparent wings (symbolizing the initiative's commitment to transparency and vulnerability disclosure).

    The 12 Founding Partners

    Project Glasswing brings together an unprecedented consortium:

    • Amazon Web Services
    • Apple
    • Broadcom
    • Cisco
    • CrowdStrike
    • Google
    • JPMorganChase
    • Linux Foundation
    • Microsoft
    • NVIDIA
    • Palo Alto Networks
    • Anthropic

    Plus over 40 additional organizations that build or maintain critical software infrastructure.

    The Investment

    • $100 million in usage credits for Mythos Preview
    • $4 million in direct donations to open-source security organizations

    Why This Is Strategically Significant

    1. Anthropic Redefines Its Safety Leadership

    Until now, Anthropic's safety narrative has been largely theoretical: Responsible Scaling Policy, Constitutional AI, alignment research. With Glasswing, Anthropic demonstrates a concrete, productive application of safety – one that creates real economic and security value.

    2. The Business Model Shifts

    A model that isn't publicly available but is licensed through controlled partnerships represents a new paradigm. Anthropic becomes the defense contractor of the digital age – with a product so powerful that its controlled deployment is itself a competitive advantage.

    3. The Cybersecurity Landscape Changes Fundamentally

    The core insight from the Frontier Red Team Blog: the same capabilities that make models better at fixing bugs also make them better at exploiting them. This means:

    • Short-term: Attackers could benefit if frontier labs aren't careful
    • Long-term: Defenders will be more efficient, finding and fixing bugs before code ever ships

    The transition period will be turbulent.

    What Companies Should Do Now

    Project Glasswing isn't an abstract research project – it has direct implications:

    Security teams should evaluate how AI-powered vulnerability scanning can be integrated into their workflows. If Mythos Preview finds bugs in every major OS, the next comparable model will find them in your software too.

    CTOs and CISOs need to reassess the threat landscape. The window between vulnerability discovery and exploit has collapsed from months to minutes.

    Open-source maintainers should explore access through the Linux Foundation – the initiative offers enterprise-grade security tools for projects that normally couldn't afford them.

    Our Take

    Claude Mythos Preview and Project Glasswing mark a turning point. Not because a single model delivers impressive benchmarks – but because Anthropic draws the institutional consequence from it.

    Choosing not to release a model because its capabilities are too dangerous, and instead launching an industry-wide defense initiative – that's a move we haven't seen before in the AI industry.

    The question is no longer whether AI will transform cybersecurity. The question is whether defenders are fast enough to leverage the head start that initiatives like Glasswing provide.

    For companies positioning themselves now, this is an enormous opportunity. For everyone else, the clock is ticking.

    TeilenLinkedInWhatsAppE-Mail

    Related Articles

    Claude Mythos Preview: Benchmarks, Exploit Chains, and the Technical Deep Dive
    April 11, 20267 min

    Claude Mythos Preview: Benchmarks, Exploit Chains, and the Technical Deep Dive

    Claude Mythos Preview isn't incrementally better – it's a different category. 93.9% on SWE-bench, 100% on Cybench, and e…

    Read more
    Editorial illustration of the Claude Design launch – warm sand-tone background with the rust-orange Claude spark motif, glassmorphic UI panels showing a wireframe, color tokens, and a dashboard mockup, with subtle Adobe-red and Figma-purple accents hinting at the market disruption.
    April 17, 20265 min

    Claude Design Is Here: How Anthropic Labs Wiped $30B Off Figma, Adobe and Wix in a Single Day

    On April 17, 2026, Anthropic launched Claude Design – the first Anthropic Labs product for visual work. Powered by Opus …

    Read more
    Claude Opus 4.7 Is Here: What Premium Teams Need to Know About the Tokenizer, xhigh, and Spend Controls
    April 17, 20265 min

    Claude Opus 4.7 Is Here: What Premium Teams Need to Know About the Tokenizer, xhigh, and Spend Controls

    Anthropic just released Claude Opus 4.7. Same price as 4.6, but noticeably better at coding, agents, and visual output. …

    Read more
    Chess pieces as a metaphor for the platform conflict between Anthropic and Lovable
    April 14, 20263 min

    Anthropic Is Building an App Builder – And It's Coming for Europe's Vibe-Coding Star Lovable

    Leaked screenshots reveal an integrated app builder inside Claude. What this means for Lovable, the European startup eco…

    Read more
    The AI Race in 31 Milestones: The Complete OpenAI vs. Anthropic Timeline
    April 11, 20262 min

    The AI Race in 31 Milestones: The Complete OpenAI vs. Anthropic Timeline

    From GPT-4o to Project Glasswing: Every acquisition, model launch, and product release from OpenAI and Anthropic on an i…

    Read more
    OpenAI Buys a TV Show. Anthropic Builds the Future of Software. And Google? It's Playing a Different Game Entirely.
    April 11, 20266 min

    OpenAI Buys a TV Show. Anthropic Builds the Future of Software. And Google? It's Playing a Different Game Entirely.

    OpenAI buys TBPN, a Jony Ive hardware startup, and builds a desktop superapp. Anthropic turns Claude into a Developer OS…

    Read more
    Why 🦞 Became the Secret Handshake of the Agentic AI Movement
    May 19, 20263 min

    Why 🦞 Became the Secret Handshake of the Agentic AI Movement

    How a crustacean became the tribal emoji of the agentic AI scene – from Anthropic memes to X bios full of lobster claws.…

    Read more
    Two robotic hands tearing a golden Claude Pro ticket in half while token coins spill out, with a rising price chart in the background
    April 22, 20265 min

    Claude Code Out of Pro: The End of the All-You-Can-Eat Era for Coding Agents

    Anthropic is removing Claude Code from the Pro plan. Cursor already moved to token-based pricing. Codex is likely next. …

    Read more
    Claude Managed Agents architecture – brain connected to multiple hands representing tools and sandboxes
    April 8, 20265 min

    Claude Managed Agents: Anthropic's Play to Own the Agent Runtime

    Anthropic launches Managed Agents in public beta – a hosted runtime that decouples the 'brain' from the 'hands.' No more…

    Read more