TalkRL: The Reinforcement Learning Podcast cover image

Danijar Hafner 2

TalkRL: The Reinforcement Learning Podcast

00:00

Daydreamer: How to Train a Robot to Walk in One Hour

Daydreamer uses the dreamer v3 algorithm, actually a slightly earlier version of that algorithm. We trained on visual pick and place from sparse rewards with an arm that picks up balls and places them into a different bin. Train it to roll over and then figure out how to stand up and walk without any simulators in just one hour and there were no resets.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app