Anthropic opens Mythos model, but guards cybersecurity access

AIJune 10, 2026 at 12:30 AM

Anthropic opens Mythos model, but guards cybersecurity access

Read full story

Source: SCMP

TLDR: WASHINGTON—Anthropic releases public Claude Fable 5, built on Mythos, but blocks requests that try to find cybersecurity vulnerabilities, including package or code probes. The move widens access beyond Glasswing, affecting developers, researchers, and governments watching AI security risks.

Key Takeaways:

Mythos access stayed mostly inside Anthropic's Glasswing network of about 200 organizations, including the US government, after thousands of vulnerabilities were reported.
Anthropic says Claude Fable 5 is its most powerful wider model, and it will refuse cybersecurity requests and fall back to Opus 4.8.
Wider rollout could boost Anthropic's momentum in the AI race, while stricter refusal rules aim to limit misuse that turns AI into an exploit finder.

Anthropic is turning Mythos outward, but drawing a bright line under hacking requests. The interesting tension is how quickly public curiosity meets private guard rails.

No comments yet. Be the first to share your thoughts!

Anthropic is turning Mythos outward, but drawing a bright line under hacking requests. The interesting tension is how quickly public curiosity meets private guard rails.

Q&A

If the model refuses to help find vulnerabilities, will attackers shift to asking for adjacent tasks like code review patterns or exploit explainers?

Safeguards can block explicit vulnerability hunting, but adversaries often try to reframe requests. The key will be whether Anthropic can prevent indirect jailbreak paths.

What does fallback to Opus 4.8 reveal about Anthropic's evaluation priorities?

It suggests Anthropic is optimizing for containment under risky prompts, not only best possible answers. The system becomes safer by downgrading capability when it detects misuse.

How might the Glasswing group push the public model harder than regulators ever could?

Organizations with early access often pressure models with real workflows, edge cases, and internal red teaming. That feedback can quickly harden guard rails or expose new bypass angles.

Will public availability increase the odds that security researchers find both vulnerabilities and gaps in Anthropic's safeguards?

Yes. More users means more attempts to test boundaries, which can improve defenses through disclosure but also accelerate discovery of weak spots.

Historically, why do cybersecurity tool restrictions fail to fully stop misuse despite safeguards?

Restrictions can reduce obvious pathways, but determined users improvise through alternative tooling and prompt engineering. Long term, success depends on monitoring, policy enforcement, and continuous model updates.

Anthropic opens Mythos model, but guards cybersecurity access

Key Takeaways:

Q&A

If the model refuses to help find vulnerabilities, will attackers shift to asking for adjacent tasks like code review patterns or exploit explainers?

What does fallback to Opus 4.8 reveal about Anthropic's evaluation priorities?

How might the Glasswing group push the public model harder than regulators ever could?

Will public availability increase the odds that security researchers find both vulnerabilities and gaps in Anthropic's safeguards?

Historically, why do cybersecurity tool restrictions fail to fully stop misuse despite safeguards?

Top in AI

Kingdom Hearts fans pressure Square Enix over box art

Art Directors Guild hits Scorsese over FLUX AI partnership

Mythos Claude Fable makes users feel powerless

German court pressures Google: AI Overviews carry direct liability

Vision Pro gets gaze snapshot Siri in VisionOS 27