Skip to content

Safety Models

Safety models analyze the state of AI safety research, practices, and tradeoffs. They help understand what drives safety work and how to improve it.

  • Safety-Capability Tradeoff: Analyzing tradeoffs between safety and capability
  • Safety Culture Equilibrium: Dynamics of safety culture within AI labs
  • Safety Researcher Gap: Gap between safety researcher supply and demand
  • Capabilities-to-Safety Pipeline: How capabilities researchers transition to safety
  • Alignment Robustness Trajectory: Trajectory of alignment robustness as capabilities scale