Anthropic Restores Access to Claude Fable 5 with New Safety Mechanisms

Anthropic is resuming access to the Claude Fable 5 model starting July 1, following the lifting of US export restrictions. The model's return is accompanied by the implementation of a strict query classification system to prevent cyber threats.

What Happened

As of July 1, users can once again use Claude Fable 5. To ensure safety, Anthropic has implemented a new classifier capable of rerouting complex or potentially dangerous queries to the Opus 4.8 model. Until July 7, Pro, Team, Max, and Premium Enterprise subscribers will be able to use Fable 5 within 50% of their weekly limits, after which credit usage will be required.

Context

Access to the model was temporarily restricted following a report from researchers at Amazon. It was discovered that Claude Fable 5 possessed the ability to find and demonstrate exploits for software vulnerabilities. This incident drew interest from US regulators and highlighted the risks of using powerful models as tools for cyberattacks.

Why It Matters for the Industry

This case demonstrates the high speed of regulatory response to cybersecurity risks through export control tools. The situation is stimulating the development of new industry standards for assessing the severity of "jailbreaks" and red-teaming protocols involving major players such as Amazon, Google, and Microsoft. There is also a visible shift toward classifier-based routing architectures as a standard for enterprise-grade AI products.

Why It Matters for Users

Claude users should be aware that new protection mechanisms may lead to an increase in false positives. This means that harmless queries related to code writing or debugging could be mistakenly blocked or rerouted to the less powerful Opus 4.8 model, which may affect the stability and quality of responses.

What Is Not Yet Known / Limitations

There is uncertainty regarding how exactly the new classifier will affect latency and the overall cost of inference when using dynamic routing.

Sources

Author

Look at AI, Editorial Team