
8 - Assistance Games with Dylan Hadfield-Menell
AXRP - the AI X-risk Research Podcast
00:00
The Relationship Between Corrigibility and Existential Risk
There are limits to the amount of information that you should get in a co operative, irel optimal equilibriumrht. The solution does not involve fully identifying the person's utility function. There are unavoidable costs in them running their brain to generate that information. Those costs could be like direct time costs, but also indirect psychological costs. That idea of normative information and fundamental limits on it is what drives our concerns about existential risk.
Play episode from 01:20:09
Transcript


