
39 - Evan Hubinger on Model Organisms of Misalignment
AXRP - the AI X-risk Research Podcast
Evolving AI Stress Testing Frameworks
This chapter delves into the advancements in stress testing for AI models, particularly focusing on the transition from ASL2 to ASL3. The speakers discuss the critical need to assess risks and ensure safety as AI models approach ASL4, while emphasizing collaborative efforts and the importance of thorough evaluations in the landscape of AI safety.
00:00
Transcript
Play full episode
Remember Everything You Learn from Podcasts
Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.