TalkRL: The Reinforcement Learning Podcast cover image

Danijar Hafner 2

TalkRL: The Reinforcement Learning Podcast

00:00

Dreamer: A MetaRL Agent

The model integrates information over time into Markovian states. We're actually offloading a lot of what's challenging about RRL to the unsupervised model learning objective. And so we don't need rewards to learn which parts are relevant about the state. It can do some sort of meta learning. But yeah, this is almost just an immersion property of using sequence models in RRL which we should have been doing for a long time anyways.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app