The Gradient: Perspectives on AI

Marc Bellemare: Distributional Reinforcement Learning

13 snips
Dec 8, 2022
Ask episode
Chapters
Transcript
Episode notes
1
Introduction
00:00 • 2min
2
How Did You Get Into AI?
02:22 • 3min
3
Is Machine Learning a New Frontier in Machine Learning?
05:29 • 2min
4
Is There a Cross Pollination of Ideas in Deep Learning?
07:17 • 2min
5
The Story of the Arcade Learning Environment
09:14 • 5min
6
Atari Games - The Real Killer With the Atari Learning Environment
13:45 • 4min
7
The A.A.L.E.
18:13 • 3min
8
The Avalon Benchmark
21:32 • 4min
9
Avalon, Is There a Future in Retort Learning?
25:19 • 2min
10
Is Reward Enough?
27:01 • 3min
11
Is There a Connection Between Reactor Learning and Efficiency?
30:05 • 4min
12
How Do We Learn Distributions of Reward?
33:50 • 5min
13
How Do You Estimate a Distribution?
38:37 • 3min
14
The Relationship Between Distributional Enforcement Learning and G Flow Nets
42:06 • 4min
15
Using Probability Metrics in Distributional Resonance Learning (RL)
45:41 • 3min
16
The Early Thought Process in the Vashas Time Distance
48:12 • 3min
17
Is Distributional RL a Good Way to Train an RL Agent?
51:03 • 4min
18
The Space of Value Functions Is Highly Structured
54:40 • 4min
19
Is There a Connection Between Exploration and Curiosity and Developing Good Representations of Them?
58:12 • 4min
20
Deep Reinforcement Learning in the Real World
01:01:58 • 2min
21
Distribution RL
01:04:17 • 2min
22
Applying RL to Controlling Physical Systems
01:06:26 • 2min
23
Is There a Key to Research Success?
01:08:35 • 4min