TalkRL: The Reinforcement Learning Podcast cover image

Danijar Hafner 2

TalkRL: The Reinforcement Learning Podcast

00:00

Dreamer V3: A Future of Efficiency

In dreamer v3 because the algorithm is so robust we observed very predictable scaling behavior of the algorithm. So in a sense the world model lets you just trade off more compute to become more data efficient. And I think even today if we want to be ten times more data efficient we can actually do that by increasing the model size and increasing the gradient steps, while waiting longer for the whole thing to train.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app