TalkRL: The Reinforcement Learning Podcast cover image

Natasha Jaques

TalkRL: The Reinforcement Learning Podcast

00:00

Compasive AI

The communication protocol that emerged as a result of the influence reward in these environments emerged precisely because they're a little bit more complex than a prisoner's dilemma. So I do think the demandingness of the environment definitely shapes the solution that you end up coming up with. And so while simplified versions are good for really concretely narrowing down aspects of the problem, it's not the only thing we should study. It almost seems like multi Asian RL might help us get down to the essence of some of these issues without the complexities of language and culture which is always part of social experiment approaches. This somehow helps us distill down to some mathematical essence of the problem. Yeah, and I think that's

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app