TalkRL: The Reinforcement Learning Podcast cover image

Danijar Hafner 2

TalkRL: The Reinforcement Learning Podcast

CHAPTER

Dreamer V2: A Novel Algorithm for Continuous Control

In dreamer v1 we focused on continuous control from pixels. And it was very data efficient and got high final performance. But also pretty narrow in the sense that we couldn't deal with the street actions very well. We weren't competitive on standard benchmarks like Atari. My goal for dreamer v3 was to automate all of that away. So you can have an algorithm where the objective functions are robust enough that you can just run it out of the box and it'll give you good performance.

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner