Is the Tesla Car a Shared Reward?

In some environments that is the case. But I think if you think about the human world, we're not actually all jointly optimizing for the same shared reward function. We all have different goals and in the same sense as autonomous driving, like cars have some shared goals. So I think it's not always plausible to assume that reward is totally shared. If reward wasn't shared, then how would you be able to figure out how it would affect their value function? Like your assumption that you could use your own model to put the other agent in your place might not make sense because you're not sure what their goals are.

Play episode from 29:18

Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app