AXRP - the AI X-risk Research Podcast cover image

39 - Evan Hubinger on Model Organisms of Misalignment

AXRP - the AI X-risk Research Podcast

00:00

Evolving AI Stress Testing Frameworks

This chapter delves into the advancements in stress testing for AI models, particularly focusing on the transition from ASL2 to ASL3. The speakers discuss the critical need to assess risks and ensure safety as AI models approach ASL4, while emphasizing collaborative efforts and the importance of thorough evaluations in the landscape of AI safety.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app