Anthropic slows public Mythos release over safeguard gap

AIMay 26, 2026 at 07:30 PM

Read full story

Source: Mashable

TLDR: SAN FRANCISCO—Anthropic says a public release of Mythos class cybersecurity models is coming in the near future, but current safeguards are not adequate. The model already found thousands of flaws during Project Glasswing work.

Key Takeaways:

Anthropic is running Mythos through Project Glasswing, a limited access security effort, and is expanding access to U.S. and allied governments first.
Anthropic admits it has not built safeguards sufficient to stop misuse, yet it still expects Mythos level models to go widely available within 6 to 12 months.
Mythos testing reportedly produced working exploits 72.4 percent of the time and identified 23,000 plus flaws, intensifying the zero day discovery crisis for bug bounties.

Mythos is already acting like a relentless security intern, then management panics about the onboarding paperwork. If safeguards lag, the real race shifts from code to containment.

No comments yet. Be the first to share your thoughts!

Mythos is already acting like a relentless security intern, then management panics about the onboarding paperwork. If safeguards lag, the real race shifts from code to containment.

Q&A

What happens if Mythos reaches the public before adequate safeguards do?

More people gain automated exploit discovery ability, which can accelerate both patching and malicious scanning. The gap Anthropic admits could widen the window for attackers to benefit.

Why does Anthropic plan government access before public release?

Government channels can pair powerful tooling with tighter operational controls, monitoring, and incident response workflows, buying time while Anthropic works toward stronger misuse prevention.

How do bug bounty cutoffs change incentives when AI floods reports?

When maintainers cannot triage the inflow, reward programs lose throughput and legitimacy. Attackers may still learn from patterns even if defenders cannot verify everything quickly.

Could slowing disclosure help, or does it just delay collective defense?

Slower disclosure can reduce maintainer overload, but it also risks longer exposure for users. The better path is triage automation plus clearer time boxed disclosure protocols.

What does the wolfSSL patch signal about defender and attacker timing?

It shows AI can find high impact weaknesses quickly enough to matter, but the defender patch also reflects how narrow the useful window is. The side with faster validation and deployment gains.