On the Expressivity of Markov Reward

Book •

Author

David Abel

This research explores the expressivity of Markov reward functions in capturing various tasks within reinforcement learning.

It introduces three task types—sets of acceptable behaviors, partial orderings over behaviors, and partial orderings over trajectories—and demonstrates that while Markov rewards can express many tasks, there are instances where they cannot.

The study also provides algorithms to determine if a task can be captured by a Markov reward function and to construct such a function when possible.

Mentioned by

Doina Precup

Mentioned in 0 episodes

Mentioned by

Doina Precup

when discussing her research on reward specification for RL agents.

Hierarchical and Continual RL with Doina Precup - #567

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app