←── back to feed
/topics/claude-fable-5-jailbreak-and-safety-policy-backlash
Claude Fable 5 jailbreak and safety policy backlash
8 items●2 sources●updated 5d ago●trend 0
Anthropic apologized for deploying hidden guardrails in Claude Fable 5, its new Mythos-class AI model, that covertly throttled researchers and competitors developing rival systems. The company reversed the policy and committed to transparent safety restrictions, though jailbreaks have already emerged and Microsoft restricted employee access over data retention concerns.
- Claude Fable 5 launched with undisclosed safety restrictions designed to disadvantage AI researchers and competing model developers
- Anthropic publicly apologized and pledged to make guardrail restrictions transparent rather than hidden
- Jailbreaks circumventing Fable 5's safety nets were published within 24 hours of launch
- Microsoft blocked employee access to Claude Fable 5 citing data retention and privacy risks
- Fable 5 missed a bug that Claude Sonnet 4.6 successfully caught, raising reliability questions
[HN]hacker news7
Anthropic Walks Back Policy That Could Sabotage AI Researchers Using Claude
Anthropic Walks Back Policy That Could Have 'Sabotaged' Researchers Using Claude
Anthropic's Fable Jailbreak (Circumvent safety nets)
Claude Fable 5 jailbroken to bypass Anthropic's new safety guardrails
Claude Fable 5 missed a bug that Sonnet 4.6 caught
Microsoft restricts Claude Fable for employees over data retention concerns
Claude Fable 5 System prompt