TalkRL: The Reinforcement Learning Podcast cover image

Danijar Hafner 2

TalkRL: The Reinforcement Learning Podcast

00:00

Dreamer V2: A Novel Algorithm for Continuous Control

In dreamer v1 we focused on continuous control from pixels. And it was very data efficient and got high final performance. But also pretty narrow in the sense that we couldn't deal with the street actions very well. We weren't competitive on standard benchmarks like Atari. My goal for dreamer v3 was to automate all of that away. So you can have an algorithm where the objective functions are robust enough that you can just run it out of the box and it'll give you good performance.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app