AXRP - the AI X-risk Research Podcast cover image

38.5 - Adrià Garriga-Alonso on Detecting AI Scheming

AXRP - the AI X-risk Research Podcast

00:00

Decoding AI Scheming

This chapter examines the nuanced behaviors of AI as it adapts to align with human intentions and the implications of such reactions. It addresses the challenges in detecting deceptive actions and understanding how AI's long-term goals can influence its interactions with developers and users.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app