A stylized five made of butterflies – visual for Claude Fable 5

    Claude Fable 5 & Mythos 5: When AI Shifts from Tasks to Responsibilities

    9. Juni 20266 min read
    Till Freitag

    TL;DR: „Anthropic launched Claude Fable 5 (generally available) and Mythos 5 (Project Glasswing) – both Mythos-Class models. Fable 5 is SOTA on nearly all benchmarks, especially on long-horizon agent tasks. The real paradigm shift: AI no longer gets individual tasks, but permanent responsibilities."

    — Till Freitag

    What Anthropic Launched Today

    On June 9, 2026, Anthropic announced Claude Fable 5 and Claude Mythos 5 – two configurations of the same Mythos-Class model.

    Model Access Price (In/Out per 1M Tokens)
    Fable 5 Paid Plans, generally available $10 / $50
    Mythos 5 Project Glasswing (Cyberdefender, US Government), Trusted Access Program follows $10 / $50

    Both run on the same underlying model. Mythos 5 has fewer safeguards in certain areas – specifically for cybersecurity defense workflows. Fable 5 routes sensitive topics to Opus 4.8 as a fallback (Anthropic says this happens in under 5% of sessions).

    Pricing-wise: less than half of Claude Mythos Preview. That's the actually aggressive move.

    The Numbers (Briefly)

    • SOTA on nearly all tested benchmarks – the longer and more complex the task, the clearer the lead over Opus 4.8.
    • Stripe: Codebase-wide migration in a 50-million-line Ruby codebase – one day instead of two months of team work.
    • Cognition FrontierCode: Highest score among all frontier models, even at "Medium Effort" (i.e., token-efficient).
    • Hebbia Finance Benchmark: Top score for senior-level reasoning with documents, charts, problem solving.
    • Vision: Beats Pokémon FireRed with pure vision harness – no maps, no game-state helpers.
    • Long Context: Stays focused over millions of tokens; persistent file-based memory triples performance vs. Opus 4.8 (measured on Slay the Spire).
    • Drug Design (Mythos 5): Internal Anthropic experts accelerated aspects of the drug design process by ~10×. 9 of 14 protein targets yielded strong drug candidates.
    • Genomics: Mythos 5 autonomously trained an ML model in one week that beats a model published in Science – at 100× smaller size.

    That's the obligatory part. The subtext is more interesting.

    The Real Shift: From Tasks to Responsibilities

    Felix Rieseberg (Engineering Lead for Claude Code & Cowork) sums it up in a LinkedIn post at launch:

    "I no longer tell Claude to investigate a particular crash report. It runs in a loop, watching every crash report that comes in. Its job is no longer to help me fix a crash, it's to keep our apps from crashing."

    This is the third era, which Rieseberg says begins "quietly":

    1. Era 1 – Answers: ChatGPT as a better search engine, code autocomplete.
    2. Era 2 – Tasks: You give the model a concrete problem ("fix this bug", "write this document"). Human always in the loop.
    3. Era 3 – Responsibilities: You give the model a permanent responsibility. It decides when to become active, what to work on, when to escalate.

    Dan Shipper – founder of Every and one of the most prominent voices tracking the AI product stack – also weighed in from the perspective of a builder who works with these tools daily:

    Dan Shipper also commented on the launch on LinkedIn, emphasizing that Anthropic with Fable 5 delivers not just a faster model, but a new standard for Agentic AI: The time between "idea" and "running system" shrinks from weeks to hours for productivity builders.

    This aligns exactly with the picture we painted ten weeks ago in From Chat to Workflow: Anthropic's Operational AI – Dispatch, Computer Use, persistent tasks. Fable 5 is the model layer that now truly makes this architecture viable.

    Why This Matters for Builders

    When AI no longer "processes tasks" but owns responsibilities, the product stack changes fundamentally:

    Old (Task Mode) New (Responsibility Mode)
    User clicks "Start Analysis" Agent runs as a daemon, observes events
    Output lands in chat window Output lands as PR, ticket, calendar event, Slack message
    Session-based (context dies with tab) Persistent (file-based memory, Notes, Recall)
    Success = "response was helpful" Success = "KPI moved" (Crashes ↓, Pipeline ↑)

    This has three hard consequences for products we build today:

    1. Observability becomes a product layer. When an agent runs permanently, you need audit logs, replay, cost caps – not just at enterprise scale, but from day 1.
    2. Permissions become granular. "Claude may fix bug X" is not the same as "Claude is responsible for app stability." The latter needs clear guardrails, escalation paths, and rollback strategies.
    3. The tool layer beats the model layer. Anyone already using Cursor, Claude Code, Vibe, or similar agent frontends gets the Fable 5 leap essentially for free. Anyone still building their own wrappers has to touch integration code with every new generation (see our analysis of the Agentic Coding Tool Stack →).

    What Does "Mythos-Class" Mean?

    Anthropic has quietly rebuilt the model name stack. Previously, "Opus" was the top tier. With Mythos Preview, a new height tier was introduced – now released in two productive variants:

    • Fable 5 = Mythos-Class model with safeguards for general use.
    • Mythos 5 = Mythos-Class model with relaxed safeguards for trusted cyberdefenders under Project Glasswing.
    • Opus 4.8 = Standard workhorse underneath, catches sensitive Fable 5 queries.

    Strategically interesting: Anthropic splits the tier not primarily by power, but by safety posture and access model. Mythos is reserved for defenders and critical infrastructure – a move that institutionalizes the "Defender's Advantage" argument.

    What We're Doing Concretely

    At Till Freitag, we're rolling out Fable 5 today in three contexts:

    1. Claude Code as standard engine for our internal refactoring and migration jobs. Stripe's number (months → days) roughly matches what we see with larger monday.com apps.
    2. Long-running agents on Railway for CRM hygiene tasks (lead deduplication, pipeline cleanup, activity summaries). Here, the long-context improvement matters more than any benchmark.
    3. Vision-first workflows for screenshot-to-spec pipelines in our vibe-coding practice – specifically for UI migrations.

    Mythos 5 is currently not relevant for us – Project Glasswing is restricted to cyberdefense partners. For regulated EU workloads, our preferred path remains Mistral 3 / Medium 3.5.

    What to Do Now

    1. If you use Claude Code, Cursor, or Vibe: Updating to Fable 5 is trivial – usually just a model switch in settings. Especially worthwhile for long tasks.
    2. If you build your own agents: Consider whether you want to make the jump from "task" to "responsibility." This is an architecture decision, not prompt tuning.
    3. If you think in pricing models: $10 / $50 is aggressive. Anyone currently running on Opus 4.5 / 4.8 should recalculate costs – especially for agentic workloads with high output volume.
    4. If you work in regulated industries: Wait for the expanded Mythos Trusted Access Program – or evaluate Mistral as an EU alternative in parallel.

    Conclusion

    The SOTA numbers will be relativized in two weeks – that's 2026 reality. What remains is the architecture move: Anthropic is consistently building the model and tool layer for permanent responsibilities, not one-off tasks.

    Anyone building an AI product in 2027 that still looks like a chat window will feel it. Anyone who starts thinking of workflows as daemons rather than prompts gets a multi-year head start.

    Felix Rieseberg is right: Today was a quiet start to a third era.


    → Anthropic's Operational AI: From Chat to Workflow → Mistral 3 & Vibe: Europe's AI Hope Back in the Game → Railway + Claude Code: Deployment per Prompt → Our AI Services at Till Freitag

    TeilenLinkedInWhatsAppE-Mail

    Related Articles

    Claude Mythos & Project Glasswing: When AI Gets Too Good at Hacking, It Becomes the Defenders' Weapon
    April 11, 20264 min

    Claude Mythos & Project Glasswing: When AI Gets Too Good at Hacking, It Becomes the Defenders' Weapon

    Anthropic's new frontier model Claude Mythos Preview is so good at finding vulnerabilities that it won't be publicly rel…

    Read more
    OpenClaw Pricing Shock: How to Avoid the $500 Bill
    April 5, 20262 min

    OpenClaw Pricing Shock: How to Avoid the $500 Bill

    Anthropic just killed third-party tool coverage under Claude subscriptions. If you're running OpenClaw without prep, you…

    Read more
    From Chat to Workflow: How Anthropic Is Turning Claude Into a Digital Coworker
    March 30, 20262 min

    From Chat to Workflow: How Anthropic Is Turning Claude Into a Digital Coworker

    Dispatch, Computer Use, persistent tasks – Anthropic is layering capabilities in an order that's no accident. A strategi…

    Read more
    Why We Switched from ChatGPT to Claude – and What We Learned About LLMs Along the Way
    February 20, 20265 min

    Why We Switched from ChatGPT to Claude – and What We Learned About LLMs Along the Way

    We worked with ChatGPT for 18 months – then switched to Claude. Here's our honest comparison of all major LLMs and why C…

    Read more
    Visualization of a large pale neural network sphere and a smaller bright sphere in cyan/yellow – the shrinking frontier of open models
    June 8, 20265 min

    Nex-N2-Pro: How the Open-Model Frontier Shrunk 75 % in Six Weeks

    Six weeks ago, DeepSeek-V4-Pro with 1.6 trillion parameters was the largest open-weight model ever released. Today, Nex-…

    Read more
    Why 🦞 Became the Secret Handshake of the Agentic AI Movement
    May 19, 20263 min

    Why 🦞 Became the Secret Handshake of the Agentic AI Movement

    How a crustacean became the tribal emoji of the agentic AI scene – from Anthropic memes to X bios full of lobster claws.…

    Read more
    Visualization of Kimi K2.6 long-horizon agents: a Moonshot crescent symbol alongside distributed sub-agent nodes over a coordination gridDeep Dive
    April 21, 20268 min

    Kimi K2.6: The Most Interesting AI Optimization in 2026 Isn't Intelligence – It's Duration

    Moonshot AI open-sourced Kimi K2.6 yesterday. 1 trillion parameters, 300 sub-agents, 13 hours of autonomous code refacto…

    Read more
    Editorial illustration of the Claude Design launch – warm sand-tone background with the rust-orange Claude spark motif, glassmorphic UI panels showing a wireframe, color tokens, and a dashboard mockup, with subtle Adobe-red and Figma-purple accents hinting at the market disruption.
    April 17, 20265 min

    Claude Design Is Here: How Anthropic Labs Wiped $30B Off Figma, Adobe and Wix in a Single Day

    On April 17, 2026, Anthropic launched Claude Design – the first Anthropic Labs product for visual work. Powered by Opus …

    Read more
    Claude Opus 4.7 Is Here: What Premium Teams Need to Know About the Tokenizer, xhigh, and Spend Controls
    April 17, 20265 min

    Claude Opus 4.7 Is Here: What Premium Teams Need to Know About the Tokenizer, xhigh, and Spend Controls

    Anthropic just released Claude Opus 4.7. Same price as 4.6, but noticeably better at coding, agents, and visual output. …

    Read more