

Episode 19: Minqi Jiang, UCL, on environment and curriculum design for general RL agents
75 snips Jul 19, 2022
Chapters
Transcript
Episode notes
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43
Introduction
00:00 • 2min
How Did Your Research Interests Develop Over Time?
01:33 • 5min
The Overton Window of R L Research
06:19 • 2min
Using Self Play in Multi-Agent Learning
07:55 • 3min
Random Network Distillation - Is There a Reward?
11:18 • 3min
The Learning Potential Score for Revisions of Levels
14:22 • 4min
The U C B Bandit and the Generalization Gap?
18:33 • 2min
Game Theory
20:42 • 2min
Refutation Based Learning Environments
22:15 • 2min
Pard Is Optimizing for Minni Max Regret
24:22 • 5min
How to Reduce the Reward Reward Expenditure
28:59 • 2min
How to Maximize Reputation
30:57 • 3min
How to Find the Most Challenging Levels in the Environment
33:54 • 2min
Is There a Danger to This?
35:29 • 3min
Quality Diversity
38:04 • 3min
How to Optimize Towards a Non Stationary Objective?
41:27 • 5min
Using a Dreamer Model, Is a Good Idea, Right?
46:14 • 2min
Is There a Way to Clone a Poet?
47:49 • 2min
Do Hard Behaviours Have a Higher Error Rate?
50:07 • 3min
Is There a Difference Between Discovery and Copying Behavior?
52:37 • 2min
Language to Second Modality Grounding
54:44 • 2min
Learning to Communicate With Machines
56:22 • 3min
Is There a Risk of Overfitting?
59:21 • 2min
Is There a Zero Shop Prompting Pattern in Multi-Modal GPT Three?
01:01:12 • 3min
Getting Used to a Different Day Night Cycle of Life, Right?
01:03:49 • 3min
On Line Supervised Learning
01:06:24 • 2min
Open Handling
01:08:04 • 4min
Open Ended Learning
01:12:28 • 2min
Is There a Way to Speed Up Evolution?
01:14:54 • 3min
Are You Interested in Deep Learning Architectures?
01:18:17 • 2min
Is There a Lot of Innovation in RLS?
01:19:51 • 2min
Do You Have Any Controversial Opinions?
01:22:20 • 2min
I Think the Rate of Publication Is Way Too High in the Field of Machine Learning
01:24:05 • 3min
Is There a Difference Between the Publication Cycle and the Productivity?
01:26:40 • 3min
The Difference Between Dreamer Versus Dreamer?
01:29:31 • 4min
What Makes a Good Collaboration?
01:33:06 • 2min
Is There Any Way to Improve the Performance of Your Agent?
01:34:43 • 2min
Adaptive Curricula
01:36:54 • 3min
Active Learning as a Curriculum?
01:39:56 • 2min
How Did You Get Your Ph.D. In Machine Learning?
01:41:48 • 5min
The Future of Machine Learning and Machine Learning
01:46:34 • 2min
Unpublished Research Failures
01:48:25 • 2min
Is There Something Big Missing?
01:50:55 • 4min