
39 - Evan Hubinger on Model Organisms of Misalignment
AXRP - the AI X-risk Research Podcast
00:00
Evolving AI Stress Testing Frameworks
This chapter delves into the advancements in stress testing for AI models, particularly focusing on the transition from ASL2 to ASL3. The speakers discuss the critical need to assess risks and ensure safety as AI models approach ASL4, while emphasizing collaborative efforts and the importance of thorough evaluations in the landscape of AI safety.
Transcript
Play full episode