The Relationship Between Corrigibility and Existential Risk

There are limits to the amount of information that you should get in a co operative, irel optimal equilibriumrht. The solution does not involve fully identifying the person's utility function. There are unavoidable costs in them running their brain to generate that information. Those costs could be like direct time costs, but also indirect psychological costs. That idea of normative information and fundamental limits on it is what drives our concerns about existential risk.

Play episode from 01:20:09

Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app