The Space of Value Functions Is Highly Structured

The space of value functions itself was highly structured. You mentioned it's a polytope but with some unintuitive characteristics. Could you comment on that at all? Yeah so basically just to give some background on this. When in reinforcement learning we'll have the reward function that is sort of defining the objective. And then we'll have different ways of behaving. So each of these is a vector so it's a set of n dimensional vectors we're in some number of states.

Play episode from 54:40

Transcript

Episode notes

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app