Effective Altruism: Ten Global Problems – 80000 Hours cover image

Four: Brian Christian on artificial intelligence

Effective Altruism: Ten Global Problems – 80000 Hours

CHAPTER

Is There a Reward Function?

The hope is that this would cause them to become corrigible. This means being willing to be corrected because iy've made a mistake. So it's like correctible, i guess. Yet why the existing traditional systems may be over confidentd and resistant to correction or being turned off. And so people like stuart russell, for example, have been thinking about how can we incorporate some notion of uncertainty or doubt?

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner