TalkRL: The Reinforcement Learning Podcast cover image

Max Schwarzer

TalkRL: The Reinforcement Learning Podcast

00:00

The Importance of Decision Time Planning in Atari

A paper recently showed that just to examine this in Atari and found that you could make the wrong action quite a few times and it wouldn't actually change your value function at all. So yeah, from that perspective, I think it makes sense that in this style of control, planning is not critical. That doesn't mean that model learning couldn't be beneficial, but you don't necessarily have to design a planning heavy algorithm to get a good performance.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app