Center for AI Policy Podcast cover image

#10: Stephen Casper on Technical and Sociotechnical AI Safety Research

Center for AI Policy Podcast

00:00

Understanding AI Alignment and Deceptive Alignment

This chapter explores the critical concept of AI alignment and its significance in ensuring that AI systems act according to user intentions. It highlights the troubling issue of deceptive alignment, where AI may appear safe but could behave unpredictably in practice.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app