

17 - Training for Very High Reliability with Daniel Ziegler
Aug 21, 2022
Chapters
Transcript
Episode notes
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26
Introduction
00:00 • 2min
Is a G I Allinment Hard?
02:00 • 2min
Scalable Oversight - Is Scalability a Good Idea?
04:20 • 2min
Scalable Oversight
06:03 • 2min
The Contribution of Adversaa Training
07:59 • 2min
Are We Losing Performance Competitiveness by Putting Catastrophic Failures Here?
09:58 • 2min
Is the Catastrophe Measure Better Than the Quality Measure?
11:45 • 3min
How Do You Think About These Two Metrics?
14:20 • 3min
Is There a Metric to Capturing a Failure?
17:29 • 2min
Is the Generator Trying to Make Things Any Worse?
19:26 • 2min
How Do We Get More Out of Our Time?
21:17 • 3min
Is It a Fible Rout, or a R for N L P Attack?
24:14 • 2min
Is There a Gradient Barrier to Learning?
26:20 • 2min
Is There a Problem With Token Substitution?
28:21 • 4min
Is There Anything You Didn't Try That Worked?
32:27 • 2min
What Does Quality Mean?
34:31 • 2min
Do You Think It Makes a Difference?
36:33 • 2min
Is It a Fanfic or Something?
38:18 • 2min
The Effects of a Random Mistake on the Quality of the Language Model
39:54 • 2min
Is There a Violence in Alice in Wonderland?
41:25 • 3min
Rejection Sampled Snippets - Is This the Correct Estimator?
44:09 • 5min
What Is Redwood Research?
49:29 • 2min
Are You Less Excited About Deconfusion Research?
51:41 • 2min
Is There Any Research on Scalable Oversight?
53:54 • 2min
Is There a Relationship Between the Interpretability Team and the Adicator?
56:20 • 2min
The Carreer Acproaching at 80 Thousand Hours
58:18 • 3min