AXRP - the AI X-risk Research Podcast

6 - Debate and Imitative Generalization with Beth Barnes

Apr 8, 2021
Ask episode
Chapters
Transcript
Episode notes
1
Introduction
00:00 • 3min
2
It's Like, a Tree of Answers
03:00 • 4min
3
How to Train a Machine Learning Model for Debate?
07:19 • 2min
4
Train and Model to Imitate Human Judgments
09:33 • 2min
5
Are There Any Problems in Like, a Safety or an Alignment That This Would Solve?
11:25 • 2min
6
Is It Necessary?
12:55 • 3min
7
Are You Doing This Bad Thing?
16:00 • 3min
8
Is Debate Easier Than Other Things?
18:53 • 2min
9
The Tree of Arguments in a Debate?
20:41 • 5min
10
Is There a Problem With Debate?
25:42 • 3min
11
Is There a Difference Between Free Response and Debate?
28:30 • 3min
12
Is There a Problem With Debate Rit?
31:11 • 2min
13
Is There a Penalty for Choosing What Recurses On?
33:38 • 2min
14
Is There a Mechanism for a Good Debate?
35:20 • 2min
15
What Is Cross Examining?
37:29 • 3min
16
How Do You Come Up With That Strategy?
40:45 • 2min
17
Complexity Clauses
42:26 • 5min
18
The Obfiscated Arguments Problem
47:06 • 2min
19
Is It Possible to Generate Reasonable Arguments in a Debate?
49:31 • 3min
20
Is the Research Strategy Like, Like, Is It Relevant to Humans?
52:43 • 3min
21
Is There a Difference Between Human Debate and Mal Debate?
55:21 • 4min
22
Are You Trying to Be Honest?
58:53 • 3min
23
The King of France Does Not Have Hair
01:01:55 • 4min
24
Are You Going to Judge Debates?
01:05:39 • 2min
25
Answering a Question Like, Is Daniel an Australian?
01:07:18 • 2min
26
Is There a Better Answer to That Question?
01:09:19 • 4min
27
Is It a Heroistic Approach or a Cold Generalization?
01:12:54 • 4min
28
How to Train a Debater
01:17:02 • 2min
29
Is This a Lak?
01:19:05 • 2min
30
Machine Learning
01:20:57 • 5min
31
How to Label Dogs in a Train Set
01:25:53 • 2min
32
Cansev Generalise From the Human Labels?
01:27:37 • 2min
33
Do You Think We're Going to Use Imitative Generalization?
01:29:44 • 2min
34
How to Interpret a Big Nural Net in a Test Set?
01:31:24 • 5min
35
The Importance of More Interpretability
01:36:06 • 4min
36
Can We Sort of Figure Out Which Bits Are the World Model and Which Are the Sent?
01:39:48 • 5min
37
The Forertion of Debate - Is There a Floor?
01:44:43 • 1min
38
Is Debate Really a Good Idea to Train Ema Systems?
01:46:04 • 3min
39
How to Train Houses Old Attractively Train an Ame System
01:48:48 • 2min
40
Is There a Difference Between Deliberation and Distillation?
01:50:25 • 2min
41
Ida, Is It a Good Plan?
01:52:49 • 3min
42
What's the Bad Thing About Debate?
01:55:26 • 3min