Reward Reinforcement Learning (RL)

In your book, you talk about finding the right mix and match of reward functions. I think it's useful to think about what should we actually optimize for, right? You know, kind of what are the objectives in the world in which we make up to win, win, win. That's the objective. In RL, that's all in the world because maybe that's a state which I can observe and so on. This is very useful because it tells us that when we're being efficacious and when we're not, this is really important. We also need to learn things intrinsically like we need to control our own mental health. And similarly, policies must be able to teach people

Transcript

Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app