
Model Evaluation for Extreme Risks
AI Safety Fundamentals
Limitations and Hazards of Model Evaluation for Extreme Risks
This chapter discusses the limitations and hazards of model evaluation as a tool for addressing extreme risks, including the dependence of risks on system interactions, anticipating pathways, identifying properties, scale effects, and the underdeveloped ecosystem for evaluations and audits.
00:00
Transcript
Play full episode
Remember Everything You Learn from Podcasts
Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.