Chapters
Transcript
Episode notes
1 2 3 4 5 6 7
Introduction
00:00 • 2min
How to Craft a Good Governance Scheme Around Model Evaluations
02:16 • 2min
Behavioral Non-Faint Tuning Evaluations
04:41 • 3min
The Importance of Capabilities Evaluations
07:13 • 2min
The Importance of RL in Exploration Hacking
09:21 • 2min
How to Make Behavioral IID Fine-Tuning Evaluations Trustworthy
11:35 • 2min
Gradient Hacking and the Superhuman Capacity Regime
13:55 • 4min


