Responsible Scaling in Practice - DeepMind FSF vs Anthropic RSP vs OpenAI Preparedness (2025)

Nov 14, 2025
2 min read

Scaling drives capability jumps; leading labs now publish thresholded safety policies. Using them as templates will lift your governance to frontier-grade.

What does each framework require?

Control theme	DeepMind FSF (2025)	Anthropic RSP / ASL	OpenAI Preparedness v2 (Apr 15, 2025)
Triggering events	Critical Capability Levels (CCLs) incl. deceptive-alignment risk; stronger security by CCL	Capability Thresholds → escalate to ASL-3 safeguards	Tracked Categories (Bio/Chem, Cyber, AI self-improvement) with risk scoring
Security posture	Heightened exfiltration controls at higher CCLs	ASL-3 Security & Deployment Standards	Secure development & deployment requirements, isolation patterns
Evaluations	Consistent procedure; pre-deployment mitigations	External red-teaming; evals tied to thresholds	Preparedness eval batteries; SAG review; system cards
Deployment constraints	Mitigation playbooks per capability	Constraints at ASL-3 (rate limits, narrower access)	Release gates; block/allow-with-constraints based on scores
Transparency	Public framework iterations	Public RSP + updates	Public system cards (e.g., o3/o4-mini)

Sources: OpenAI+Google+DeepMind

Implementation checklist for enterprises

Define your thresholds. Borrow FSF/ASL/Preparedness triggers; customize to your sector.
Create a release gate. Require: recent eval results, red-team evidence, mitigations, decision owner sign-off.
Align with EU AI Act. Keep a technical file: intended purpose, data sources, known limits, monitoring plan. From Aug 2, 2025, GPAI duties apply to providers—your procurement should verify them.
Document constraints. Rate limits, tool-use isolation, PII handling, logging and rollback procedures.
Schedule re-evals. At every model upgrade and quarterly for critical services; include supplier updates (system cards).