
Episode 22: Archit Sharma, Stanford, on unsupervised and autonomous reinforcement learning
Generally Intelligent
00:00
Are There Any Mistakes That You've Made as a Researcher?
Top down organizations can be more productive than decentralized ones, according to the researcher. He says he's improved on not getting too attached to ideas in his research. The scriminator augmented model based RL was a good idea but didn't work out well for him and other researchers.
Transcript
Play full episode