Center for AI Policy Podcast cover image

#10: Stephen Casper on Technical and Sociotechnical AI Safety Research

Center for AI Policy Podcast

CHAPTER

Understanding AI Alignment and Deceptive Alignment

This chapter explores the critical concept of AI alignment and its significance in ensuring that AI systems act according to user intentions. It highlights the troubling issue of deceptive alignment, where AI may appear safe but could behave unpredictably in practice.

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner