The Nonlinear Library cover image

AF - AI Control: Improving Safety Despite Intentional Subversion by Buck Shlegeris

The Nonlinear Library

00:00

Preventing Catastrophic Failures Caused by Untrusted Models

Exploring the importance of a robust auditing process to prevent concentrated catastrophes caused by untrusted models and the necessity for scalable oversight techniques to counteract potential failures.

Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner
Get the app