
Episode 22: Archit Sharma, Stanford, on unsupervised and autonomous reinforcement learning
Generally Intelligent
00:00
How Robust Is Quail?
Quail was trained on a variety of tasks. It can still take several thousands of steps to like actually solve the task. What do we need to add in here to make it go even faster is an interesting question. We have just barely started to scratch the surface of this.
Transcript
Play full episode