AI-powered
podcast player
Listen to all your favourite podcasts with AI-powered features
The Intuition of Algorithms in Markov Decisions
Markov decisions are id, so you can't really enforce that. There's a more interesting case, which is the case of non markovan. So, for example, alice might say, i prefer that bob always go in the same direction. And with this kind of linear programme, either we find a solution, or if the linear programme does not have a solution, no markovian reward function is actually consistent with the original preferences that were expressed.