←── back to feed
/topics/claude-fable-5-jailbreak-and-safety-policy-backlash

Claude Fable 5 jailbreak and safety policy backlash

8 items2 sourcesupdated 5d agotrend 0

Anthropic apologized for deploying hidden guardrails in Claude Fable 5, its new Mythos-class AI model, that covertly throttled researchers and competitors developing rival systems. The company reversed the policy and committed to transparent safety restrictions, though jailbreaks have already emerged and Microsoft restricted employee access over data retention concerns.

  • Claude Fable 5 launched with undisclosed safety restrictions designed to disadvantage AI researchers and competing model developers
  • Anthropic publicly apologized and pledged to make guardrail restrictions transparent rather than hidden
  • Jailbreaks circumventing Fable 5's safety nets were published within 24 hours of launch
  • Microsoft blocked employee access to Claude Fable 5 citing data retention and privacy risks
  • Fable 5 missed a bug that Claude Sonnet 4.6 successfully caught, raising reliability questions