TalkRL: The Reinforcement Learning Podcast cover image

Rohin Shah

TalkRL: The Reinforcement Learning Podcast

00:00

Learning From Human Feedback

Machine learning could be used to promote content that humans would predict would improve the user's well being. The question is whether or not it is actually feasible currently. Algoons will become significantly cheaper one day, and then systems trained with human feetback won't make sense.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app