Effective Altruism: Ten Global Problems – 80000 Hours cover image

Four: Brian Christian on artificial intelligence

Effective Altruism: Ten Global Problems – 80000 Hours

00:00

How Can You Learn From Your Own Estimate?

There's been, you know, a series of breakthrough starting in really, the 19 seventies. There's an idea called temporal difference learning, which says, rather than waiting till you actually get the reward, you can learn from your own estimate changing. Increasingly, social media companies like facebook are using reinforcement learning to model how they send no cations out. Ye ad, i guess howas hows reinforcement, letting warkt wat?

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app