The Importance of Decision Time Planning in Atari

A paper recently showed that just to examine this in Atari and found that you could make the wrong action quite a few times and it wouldn't actually change your value function at all. So yeah, from that perspective, I think it makes sense that in this style of control, planning is not critical. That doesn't mean that model learning couldn't be beneficial, but you don't necessarily have to design a planning heavy algorithm to get a good performance.

Play episode from 32:01

Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app