TalkRL: The Reinforcement Learning Podcast cover image

Natasha Jaques

TalkRL: The Reinforcement Learning Podcast

CHAPTER

Compasive AI

The communication protocol that emerged as a result of the influence reward in these environments emerged precisely because they're a little bit more complex than a prisoner's dilemma. So I do think the demandingness of the environment definitely shapes the solution that you end up coming up with. And so while simplified versions are good for really concretely narrowing down aspects of the problem, it's not the only thing we should study. It almost seems like multi Asian RL might help us get down to the essence of some of these issues without the complexities of language and culture which is always part of social experiment approaches. This somehow helps us distill down to some mathematical essence of the problem. Yeah, and I think that's

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner