
Danijar Hafner 2
TalkRL: The Reinforcement Learning Podcast
Dreamer: A Framework for Unsupervised Exploration
Director is the first algorithm that actually combines all these three aspects in at least one form. It learns a world model, it does unsupervised exploration, and it learns a goal condition policy. And empirically it works very well on sparse reward tasks. So I think there's a lot of promise there. We're starting to try it out on robots now. But the design space is much larger than that, right? There are a lot of details in how to implement these different components.
00:00
Transcript
Play full episode
Remember Everything You Learn from Podcasts
Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.