
Model Evaluation for Extreme Risks
AI Safety Fundamentals
00:00
Introduction
In this chapter, the authors highlight the significance of model evaluation in addressing extreme risks posed by AI systems. They emphasize the need for developers to identify dangerous capabilities and assess the propensity of models to cause harm, which informs risk assessments and ensures responsible training, deployment, transparency, and security.
Play episode from 00:00
Transcript


