Daydreamer: How to Train a Robot to Walk in One Hour

Daydreamer uses the dreamer v3 algorithm, actually a slightly earlier version of that algorithm. We trained on visual pick and place from sparse rewards with an arm that picks up balls and places them into a different bin. Train it to roll over and then figure out how to stand up and walk without any simulators in just one hour and there were no resets.

Play episode from 20:41

Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app