The Relationship Between Distributional Enforcement Learning and G Flow Nets

distributional RL is surprisingly different from classic reinforcement learning approaches without getting too deep into technical weeds. With G flow net the relationship is more symmetric in the sense that they're attempting to make two distributions match and yes there's a relationship there's an edge between these two distributions but it's less temporal if that makes sense. I see another aspect of all this that I wanted to dig into a bit is I think as it appears everywhere in deep learning these days there was a pretty clear theory practice gap that you were encountering especially in some of the initial follow-up papers after C51 algorithm for instance where you're like okay these things seem to work really well but why?

Play episode from 42:06

Transcript

Episode notes

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app