
Episode 22: Archit Sharma, Stanford, on unsupervised and autonomous reinforcement learning
Generally Intelligent
Are There Any Mistakes That You've Made as a Researcher?
Top down organizations can be more productive than decentralized ones, according to the researcher. He says he's improved on not getting too attached to ideas in his research. The scriminator augmented model based RL was a good idea but didn't work out well for him and other researchers.
00:00
Transcript
Play full episode
Remember Everything You Learn from Podcasts
Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.