
Can Defense in Depth Work for AI? (with Adam Gleave)
Future of Life Institute Podcast
00:00
Scalable Oversight to Reduce Deception
Nathan asks about deception; Adam presents experiments training lie detectors and scalable oversight to lower deception rates and caveats about training regimes.
Transcript
Play full episode