AXRP - the AI X-risk Research Podcast cover image

39 - Evan Hubinger on Model Organisms of Misalignment

AXRP - the AI X-risk Research Podcast

CHAPTER

Evolving AI Stress Testing Frameworks

This chapter delves into the advancements in stress testing for AI models, particularly focusing on the transition from ASL2 to ASL3. The speakers discuss the critical need to assess risks and ensure safety as AI models approach ASL4, while emphasizing collaborative efforts and the importance of thorough evaluations in the landscape of AI safety.

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner