
 The Gradient: Perspectives on AI
 The Gradient: Perspectives on AI Marc Bellemare: Distributional Reinforcement Learning
 13 snips 
 Dec 8, 2022  Chapters 
 Transcript 
 Episode notes 
 1  2  3  4  5  6  7  8  9  10  11  12  13  14  15  16  17  18  19  20  21  22  23 
 Introduction 
 00:00 • 2min 
 How Did You Get Into AI? 
 02:22 • 3min 
 Is Machine Learning a New Frontier in Machine Learning? 
 05:29 • 2min 
 Is There a Cross Pollination of Ideas in Deep Learning? 
 07:17 • 2min 
 The Story of the Arcade Learning Environment 
 09:14 • 5min 
 Atari Games - The Real Killer With the Atari Learning Environment 
 13:45 • 4min 
 The A.A.L.E. 
 18:13 • 3min 
 The Avalon Benchmark 
 21:32 • 4min 
 Avalon, Is There a Future in Retort Learning? 
 25:19 • 2min 
 Is Reward Enough? 
 27:01 • 3min 
 Is There a Connection Between Reactor Learning and Efficiency? 
 30:05 • 4min 
 How Do We Learn Distributions of Reward? 
 33:50 • 5min 
 How Do You Estimate a Distribution? 
 38:37 • 3min 
 The Relationship Between Distributional Enforcement Learning and G Flow Nets 
 42:06 • 4min 
 Using Probability Metrics in Distributional Resonance Learning (RL) 
 45:41 • 3min 
 The Early Thought Process in the Vashas Time Distance 
 48:12 • 3min 
 Is Distributional RL a Good Way to Train an RL Agent? 
 51:03 • 4min 
 The Space of Value Functions Is Highly Structured 
 54:40 • 4min 
 Is There a Connection Between Exploration and Curiosity and Developing Good Representations of Them? 
 58:12 • 4min 
 Deep Reinforcement Learning in the Real World 
 01:01:58 • 2min 
 Distribution RL 
 01:04:17 • 2min 
 Applying RL to Controlling Physical Systems 
 01:06:26 • 2min 
 Is There a Key to Research Success? 
 01:08:35 • 4min 
