TalkRL: The Reinforcement Learning Podcast cover image

Danijar Hafner 2

TalkRL: The Reinforcement Learning Podcast

00:00

Dreamer 3 for Minecraft and Making Diamonds in Minecraft

Dreamer 3 can make diamonds in Minecraft, which is a very hard exploration problem. It took 17 days for our training runs to finish; you don't want to tune hyperparameters and fiddle with the algorithm at those time scales. We didn't really expect that it was possible just with the robust objective functions and the entropy-regularized policy objective we use in Dream Every 3.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app