
Rohin Shah
TalkRL: The Reinforcement Learning Podcast
Learning From Human Feedback
Machine learning could be used to promote content that humans would predict would improve the user's well being. The question is whether or not it is actually feasible currently. Algoons will become significantly cheaper one day, and then systems trained with human feetback won't make sense.
00:00
Transcript
Play full episode
Remember Everything You Learn from Podcasts
Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.