
Scott Emmons
Expert in categorizing problems with Reinforcement Learning from Human Feedback
Best podcasts with Scott Emmons
Ranked by the Snipd community

5 snips
Jun 12, 2024 • 1h 41min
33 - RLHF Problems with Scott Emmons
Expert Scott Emmons discusses challenges in Reinforcement Learning from Human Feedback (RLHF): deceptive inflation, overjustification, bounded human rationality, and solutions. Touches on dimensional analysis and his research program, emphasizing the importance of addressing these challenges in AI systems.