AXRP - the AI X-risk Research Podcast cover image

8 - Assistance Games with Dylan Hadfield-Menell

AXRP - the AI X-risk Research Podcast

00:00

The Relationship Between Corrigibility and Existential Risk

There are limits to the amount of information that you should get in a co operative, irel optimal equilibriumrht. The solution does not involve fully identifying the person's utility function. There are unavoidable costs in them running their brain to generate that information. Those costs could be like direct time costs, but also indirect psychological costs. That idea of normative information and fundamental limits on it is what drives our concerns about existential risk.

Play episode from 01:20:09
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app