The Variational Empowerment Paper

Unsupervised RL is like filling out supervised. Like you just need to learn which things are not reversible in your environment. I think that's a really good way of thinking about it. And one of the really cool things about this is that it's fully unsupervised. They actually have some work which does that. It was very clever idea because it was actually sent unsupervised as well.

Play episode from 56:14

Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app