
40 - Jason Gross on Compact Proofs and Interpretability
AXRP - the AI X-risk Research Podcast
00:00
Exploring Guaranteed Safe AI and the Importance of Output Safety
This chapter explores the concept of guaranteed safe AI, focusing on the desire to ensure the safety of AI system outputs. It examines the implications of a recent paper, debating the emphasis on the AI itself versus the generated code, highlighting the need for predictability in high-stakes situations.
Transcript
Play full episode