AXRP - the AI X-risk Research Podcast cover image

27 - AI Control with Buck Shlegeris and Ryan Greenblatt

AXRP - the AI X-risk Research Podcast

00:00

Exploring AI Actions and Speed

The chapter delves into scenarios where generating actions may be easier than checking them in AI systems. It discusses the challenges of verifying actions efficiently, highlighting the importance of additional artifacts for verification. The conversation also addresses the complexities of AI control, strategies to address vulnerabilities, and the balance between review and generation time.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app