Chapters
Transcript
Episode notes
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17
Introduction
00:00 • 5min
The Challenges and Limitations of RLHF
05:02 • 3min
Is This Data Set Reusable?
08:28 • 2min
Optimize for Reward, but Maximize the Reward Function
10:11 • 2min
The Reward Model Isn't Perfect, Right?
11:50 • 2min
Token Level Probabilities
13:39 • 2min
Recursive Reward Models - Are We Going to Need That Soon?
15:18 • 2min
Are You Up for Talking About AGI?
17:23 • 2min
Ai
19:43 • 3min
Chat GPT - Robotics Is Super Hard
22:23 • 3min
Do You Really Know What You're Doing?
25:21 • 3min
Is There a Language Model That Can Understand Language?
28:10 • 2min
Is It a Good Idea to Go Back to Academia?
30:28 • 4min
Do You Have a Clear Idea of AI?
34:09 • 3min
Social Learning Versus Imitative Learning
36:50 • 4min
Adaptive Online Generalization for Self Driving Cars
40:59 • 2min
Is There a Distractor in Deep Learning?
43:02 • 3min