TalkRL: The Reinforcement Learning Podcast cover image

Rohin Shah

TalkRL: The Reinforcement Learning Podcast

00:00

Inter Active Reward Learning

Inversoro paradime assumes that we were not uncertain in the end. So there can be uncertainty, but like the human isn't in the environment. The inter active reward learning setting is mostly just a thing we made up because it was a thing that people often brought up as like, maybe this will work. We wanted to talk about it and show why it doesn't, doesn't, in fact works.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app