

4 - Risks from Learned Optimization with Evan Hubinger
Feb 17, 2021
Chapters
Transcript
Episode notes
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41
Introduction
00:00 • 2min
What's the Difference Between Mesa and Meta in Machine Learning?
01:41 • 2min
Is Optimization a Mechanistic Process?
04:06 • 3min
Neural Networks
07:16 • 6min
Will It Generalize Properly to This New Datapoint?
12:56 • 4min
Generalization Bounds Aren't Enough, You Know?
17:06 • 2min
The Deceptive Linear Training Set Up No Longer Works
19:20 • 4min
Admosperal Training
23:32 • 6min
What Is a Messimess Optimiser?
29:31 • 3min
Openin Five Agents - Is There a Coherent Objective?
32:10 • 2min
Is There a Behaviour Objective?
33:49 • 3min
Is There a Difference Between a Model's Mas Objective and a Behavior Objective?
36:26 • 3min
Is There an Open Research Problem in Mecha Optimisation?
39:53 • 3min
Do You Expect Optimization to Happen at Training Time?
42:50 • 5min
How Simple Are Optimisations in Machine Learning?
48:04 • 6min
Is It a Speed Prior?
53:50 • 2min
Is There a Difference Between Time and Space?
55:59 • 3min
Masoptimizer
59:25 • 2min
Do You Know How to Train Look Up Tables?
01:01:22 • 4min
Is There a Way to Get a Masoprex?
01:05:15 • 5min
Do You Want to Avoid Mesooptimizers?
01:10:05 • 4min
Machine Learning and Machine Learning in the Machine Learning Community
01:14:24 • 2min
Is There a Way to Decompose a Module?
01:16:17 • 4min
Machine Learning and Inner Alignment
01:20:21 • 5min
Sub Optomal at Being Deceptive?
01:25:11 • 2min
Proxy Linement Failures
01:26:50 • 4min
Is Sttubing My Toe Bad?
01:31:08 • 2min
What You Really Care About Is, Like, the Alil Frequency of A.
01:33:11 • 2min
Do You Want to Train a System for a Diverse Set of Tasks?
01:34:51 • 3min
Is It a Trade Worth Making?
01:38:17 • 6min
Is There an Inner Alignment Problem?
01:44:04 • 2min
Do You Get Deceptive Alignment?
01:46:03 • 4min
Training on a Huge Corpus of Data
01:50:32 • 2min
How Many Martin Lutherthers Are There?
01:52:57 • 2min
How Do You Get a Situation Where Your System Cares About Multile Episodes?
01:54:59 • 5min
Nonmaapia Problems in on Line Learning
01:59:37 • 3min
How Confident Are You That the Arguments in the Paper Are Correct?
02:02:33 • 2min
The Most Dangerous Outcomes From Ai
02:04:48 • 3min
What Do We Do About This?
02:07:43 • 2min
How to Solve a Problem Before the Observation
02:10:02 • 2min
How to Follow Your Work?
02:11:32 • 2min