
11 - Attainable Utility and Power with Alex Turner
AXRP - the AI X-risk Research Podcast
The Relationship Between Corrigeibility and Utility Preservation
i want to talk about the relationship between attainable utility preservation and cordiability. Andayf people are more interested in this a inner alignment issue, i encourage them to listen to the episode with even hubing er. I think some variance of it and some settings will help a lot with off switch corrigeability. There's an idea that you have n system that like, is amenable to being corrected by you, and like, doesn't stop you from trying to correct it. It seems like a less serious thing to not be able to do at this appointed time, do not know how to do.
00:00
Transcript
Play full episode
Remember Everything You Learn from Podcasts
Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.