The Gradient: Perspectives on AI cover image

Marc Bellemare: Distributional Reinforcement Learning

The Gradient: Perspectives on AI

00:00

The Space of Value Functions Is Highly Structured

The space of value functions itself was highly structured. You mentioned it's a polytope but with some unintuitive characteristics. Could you comment on that at all? Yeah so basically just to give some background on this. When in reinforcement learning we'll have the reward function that is sort of defining the objective. And then we'll have different ways of behaving. So each of these is a vector so it's a set of n dimensional vectors we're in some number of states.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app