TalkRL: The Reinforcement Learning Podcast cover image

Natasha Jaques

TalkRL: The Reinforcement Learning Podcast

00:00

Is the Tesla Car a Shared Reward?

In some environments that is the case. But I think if you think about the human world, we're not actually all jointly optimizing for the same shared reward function. We all have different goals and in the same sense as autonomous driving, like cars have some shared goals. So I think it's not always plausible to assume that reward is totally shared. If reward wasn't shared, then how would you be able to figure out how it would affect their value function? Like your assumption that you could use your own model to put the other agent in your place might not make sense because you're not sure what their goals are.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app