The Next Step in Model Based RL

Right now we're kind of, you know, putting aside the PPO policy for a little bit. And we're going to try and focus on like model based RL. The next step is just training models, generative models in my test. I expect to see things like tracking state. Like if you see a tree and then you turn to the left and then you turned back, presumably there's some machinery inside the generative model that is tracking the state of whether that tree is there.

Play episode from 01:15:26

Transcript

Episode notes

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app