TalkRL: The Reinforcement Learning Podcast cover image

Max Schwarzer

TalkRL: The Reinforcement Learning Podcast

00:00

How to Use Model Free Methods to Improve Performance

Model-free methods like BBF can get us this far with such little data. Is it because you captured everything so well in the value function that there's no point trying to plan on top of that? I think that's kind of it. You should think of a replay buffer as essentially a perfect non-parametric model of your environment.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app