
Episode 22: Archit Sharma, Stanford, on unsupervised and autonomous reinforcement learning
Generally Intelligent
00:00
The Variational Empowerment Paper
Unsupervised RL is like filling out supervised. Like you just need to learn which things are not reversible in your environment. I think that's a really good way of thinking about it. And one of the really cool things about this is that it's fully unsupervised. They actually have some work which does that. It was very clever idea because it was actually sent unsupervised as well.
Transcript
Play full episode