
Max Schwarzer
TalkRL: The Reinforcement Learning Podcast
00:00
The Importance of Decision Time Planning in Atari
A paper recently showed that just to examine this in Atari and found that you could make the wrong action quite a few times and it wouldn't actually change your value function at all. So yeah, from that perspective, I think it makes sense that in this style of control, planning is not critical. That doesn't mean that model learning couldn't be beneficial, but you don't necessarily have to design a planning heavy algorithm to get a good performance.
Transcript
Play full episode