TalkRL: The Reinforcement Learning Podcast cover image

Jeff Clune

TalkRL: The Reinforcement Learning Podcast

CHAPTER

The Non-Causal Model of Predicting the Next State in a Stack of Frames

This is a special case of Jan LeCun's cake where the bulk of the work was this unsupervised or self supervised training, but you had this extra step here. You were able to use all this unlabeled data and I guess turn it into labeled data. The overall project took about a year and a half to get it all to actually work. But yeah, it wasn't like we set off with the goal to do something vaguely in this space and eventually through trial and there we got to here.

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner