Dreamer V2: A Novel Algorithm for Continuous Control

In dreamer v1 we focused on continuous control from pixels. And it was very data efficient and got high final performance. But also pretty narrow in the sense that we couldn't deal with the street actions very well. We weren't competitive on standard benchmarks like Atari. My goal for dreamer v3 was to automate all of that away. So you can have an algorithm where the objective functions are robust enough that you can just run it out of the box and it'll give you good performance.

Play episode from 07:57

Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app