Bezpečnosť AI 🔥 Top

Anthropic Introduces CJS Jailbreak Severity Scale with Amazon, Microsoft, and Google

Piatok 3. júla 2026 • Source: Anthropic

What happened

On July 2, 2026, Anthropic published technical details about Fable 5's cybersecurity protections and proposed the CJS (Cybersecurity Jailbreak Severity) scale — a cross-industry standard for evaluating AI jailbreak severity, developed jointly with Amazon, Microsoft, Google, and Glasswing ecosystem partners.

Context and impact

No unified standard previously existed for comparing jailbreak severity across AI companies. The CJS scale enables coordinated industry response and prioritized remediation. Alongside Fable 5, Anthropic also launched a HackerOne program for reporting jailbreaks.

Details

CJS-0: Informational (no actionable capability)
CJS-1 through CJS-3: Moderate levels with exponential severity increases
CJS-4: Critical (direct critical infrastructure impact)
Four scoring axes: capability gain, breadth of capability, ease of weaponization, discoverability
Tier taxonomy: Prohibited / High-risk dual-use / Low-risk dual-use / Benign use
New HackerOne program for security researchers to report jailbreaks

Open original source Anthropic