
27 - AI Control with Buck Shlegeris and Ryan Greenblatt
AXRP - the AI X-risk Research Podcast
00:00
Exploring AI Actions and Speed
The chapter delves into scenarios where generating actions may be easier than checking them in AI systems. It discusses the challenges of verifying actions efficiently, highlighting the importance of additional artifacts for verification. The conversation also addresses the complexities of AI control, strategies to address vulnerabilities, and the balance between review and generation time.
Transcript
Play full episode