AI-powered
podcast player
Listen to all your favourite podcasts with AI-powered features
How to Characterize Machine Learning in a Machine Learning Environment?
In the paper, we thought of essentially a framework where you might imagine somebody who has a task in mind. And then you have a learner, called a learner bob. And so alice has a task specification in mind, and now has to translate this into a reward signal that bob would get. So for example, alice might have a preference over certain ways of behaving. But et basically says you can't always find a marcovian rewardmarcovian in the state of the agent.