TalkRL: The Reinforcement Learning Podcast cover image

Danijar Hafner 2

TalkRL: The Reinforcement Learning Podcast

CHAPTER

Dreamer V3: A Future of Efficiency

In dreamer v3 because the algorithm is so robust we observed very predictable scaling behavior of the algorithm. So in a sense the world model lets you just trade off more compute to become more data efficient. And I think even today if we want to be ten times more data efficient we can actually do that by increasing the model size and increasing the gradient steps, while waiting longer for the whole thing to train.

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner