
Episode 22: Archit Sharma, Stanford, on unsupervised and autonomous reinforcement learning
Generally Intelligent
Learning a Gradient Signal in a Neural Network
The idea here was to learn an gradient signal across discrete latent variables. The whole pair was that it might introduce a bit of bias because any function of approximation is biased, but it might cut down on the variance. It ended up being the case that the bias is actually too high. Even that learn faster in the beginning ended up like the some optimal performance.
00:00
Transcript
Play full episode
Remember Everything You Learn from Podcasts
Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.