

RLC 2024 - Posters and Hallways 4
Sep 19, 2024
David Abel from DeepMind dives into the 'Three Dogmas of Reinforcement Learning,' offering fresh insights on foundational principles. Kevin Wang from Brown discusses innovative variable depth search methods for Monte Carlo Tree Search, enhancing efficiency. Ashwin Kumar from Washington University addresses fairness in resource allocation, highlighting ethical implications. Finally, Prabhat Nagarajan from UAlberta delves into Value overestimation, revealing its impact on decision-making in RL. This dynamic conversation touches on pivotal advancements and challenges in the field.
AI Snips
Chapters
Transcript
Episode notes
Three Dogmas of RL
- The paper "Three Dogmas of Reinforcement Learning" challenges the current RL paradigm.
- It aims to spark new research directions.
Andy's Keynote Impact
- David Abel recounted Andy's keynote, highlighting the emotional impact of the stories shared.
- The standing ovation reflected deep community appreciation for Andy's contributions.
Learning Compute Usage
- Kevin Wang's research allows tree search algorithms to learn their own compute usage.
- This addresses the common concern of excessive compute costs in algorithms like AlphaZero.