AXRP - the AI X-risk Research Podcast

4 - Risks from Learned Optimization with Evan Hubinger

Feb 17, 2021
Ask episode
Chapters
Transcript
Episode notes
1
Introduction
00:00 • 2min
2
What's the Difference Between Mesa and Meta in Machine Learning?
01:41 • 2min
3
Is Optimization a Mechanistic Process?
04:06 • 3min
4
Neural Networks
07:16 • 6min
5
Will It Generalize Properly to This New Datapoint?
12:56 • 4min
6
Generalization Bounds Aren't Enough, You Know?
17:06 • 2min
7
The Deceptive Linear Training Set Up No Longer Works
19:20 • 4min
8
Admosperal Training
23:32 • 6min
9
What Is a Messimess Optimiser?
29:31 • 3min
10
Openin Five Agents - Is There a Coherent Objective?
32:10 • 2min
11
Is There a Behaviour Objective?
33:49 • 3min
12
Is There a Difference Between a Model's Mas Objective and a Behavior Objective?
36:26 • 3min
13
Is There an Open Research Problem in Mecha Optimisation?
39:53 • 3min
14
Do You Expect Optimization to Happen at Training Time?
42:50 • 5min
15
How Simple Are Optimisations in Machine Learning?
48:04 • 6min
16
Is It a Speed Prior?
53:50 • 2min
17
Is There a Difference Between Time and Space?
55:59 • 3min
18
Masoptimizer
59:25 • 2min
19
Do You Know How to Train Look Up Tables?
01:01:22 • 4min
20
Is There a Way to Get a Masoprex?
01:05:15 • 5min
21
Do You Want to Avoid Mesooptimizers?
01:10:05 • 4min
22
Machine Learning and Machine Learning in the Machine Learning Community
01:14:24 • 2min
23
Is There a Way to Decompose a Module?
01:16:17 • 4min
24
Machine Learning and Inner Alignment
01:20:21 • 5min
25
Sub Optomal at Being Deceptive?
01:25:11 • 2min
26
Proxy Linement Failures
01:26:50 • 4min
27
Is Sttubing My Toe Bad?
01:31:08 • 2min
28
What You Really Care About Is, Like, the Alil Frequency of A.
01:33:11 • 2min
29
Do You Want to Train a System for a Diverse Set of Tasks?
01:34:51 • 3min
30
Is It a Trade Worth Making?
01:38:17 • 6min
31
Is There an Inner Alignment Problem?
01:44:04 • 2min
32
Do You Get Deceptive Alignment?
01:46:03 • 4min
33
Training on a Huge Corpus of Data
01:50:32 • 2min
34
How Many Martin Lutherthers Are There?
01:52:57 • 2min
35
How Do You Get a Situation Where Your System Cares About Multile Episodes?
01:54:59 • 5min
36
Nonmaapia Problems in on Line Learning
01:59:37 • 3min
37
How Confident Are You That the Arguments in the Paper Are Correct?
02:02:33 • 2min
38
The Most Dangerous Outcomes From Ai
02:04:48 • 3min
39
What Do We Do About This?
02:07:43 • 2min
40
How to Solve a Problem Before the Observation
02:10:02 • 2min
41
How to Follow Your Work?
02:11:32 • 2min