TalkRL: The Reinforcement Learning Podcast cover image

Danijar Hafner 2

TalkRL: The Reinforcement Learning Podcast

CHAPTER

Daydreamer: How to Train a Robot to Walk in One Hour

Daydreamer uses the dreamer v3 algorithm, actually a slightly earlier version of that algorithm. We trained on visual pick and place from sparse rewards with an arm that picks up balls and places them into a different bin. Train it to roll over and then figure out how to stand up and walk without any simulators in just one hour and there were no resets.

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner