CoRecursive: Coding Stories cover image

Story: Reinforcement Learning At Facebook with Jason Gauci

CoRecursive: Coding Stories

00:00

How to Train a Go Game

This is basically back propagation. It's a form of supervised learning. Your training the nerlnet by supervising it and telling it, you know, when it gets the wrong answer what it should have gotten instead. Inds do this, you need to know what the right answer is so that you can train it. But for puzzles and games, for example, you don't have that. So look at go to this day, people haven't found the perfect go game,. And so you don't have and so you have to do something different. You have to learn from experience.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app