
#10: Stephen Casper on Technical and Sociotechnical AI Safety Research
Center for AI Policy Podcast
Understanding AI Alignment and Deceptive Alignment
This chapter explores the critical concept of AI alignment and its significance in ensuring that AI systems act according to user intentions. It highlights the troubling issue of deceptive alignment, where AI may appear safe but could behave unpredictably in practice.
00:00
Transcript
Play full episode
Remember Everything You Learn from Podcasts
Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.