
Story: Reinforcement Learning At Facebook with Jason Gauci
CoRecursive: Coding Stories
00:00
How to Train a Go Game
This is basically back propagation. It's a form of supervised learning. Your training the nerlnet by supervising it and telling it, you know, when it gets the wrong answer what it should have gotten instead. Inds do this, you need to know what the right answer is so that you can train it. But for puzzles and games, for example, you don't have that. So look at go to this day, people haven't found the perfect go game,. And so you don't have and so you have to do something different. You have to learn from experience.
Transcript
Play full episode