metr.org
Cited By (21 articles)
- Autonomous Coding
- Persuasion and Social Manipulation
- Accident Risk Cruxes
- Large Language Models
- Lab Behavior
- Capability Threshold Model
- Defense in Depth Model
- Intervention Effectiveness Matrix
- Mesa-Optimization Risk Analysis
- Risk Activation Timeline Model
- Risk Cascade Pathways Model
- Warning Signs Model
- METR
- AI Evaluations
- Technical AI Safety Research
- Corporate Responses
- AI Evaluation
- Emergent Capabilities
- Sycophancy
- AI Proliferation
- Racing Dynamics