4 - Risks from Learned Optimization with Evan Hubinger

1

Introduction

00:00 • 2min

2

What's the Difference Between Mesa and Meta in Machine Learning?

01:41 • 2min

3

Is Optimization a Mechanistic Process?

04:06 • 3min

4

Neural Networks

07:16 • 6min

5

Will It Generalize Properly to This New Datapoint?

12:56 • 4min

6

Generalization Bounds Aren't Enough, You Know?

17:06 • 2min

7

The Deceptive Linear Training Set Up No Longer Works

19:20 • 4min

8

Admosperal Training

23:32 • 6min

9

What Is a Messimess Optimiser?

29:31 • 3min

10

Openin Five Agents - Is There a Coherent Objective?

32:10 • 2min

11

Is There a Behaviour Objective?

33:49 • 3min

12

Is There a Difference Between a Model's Mas Objective and a Behavior Objective?

36:26 • 3min

13

Is There an Open Research Problem in Mecha Optimisation?

39:53 • 3min

14

Do You Expect Optimization to Happen at Training Time?

42:50 • 5min

15

How Simple Are Optimisations in Machine Learning?

48:04 • 6min

16

Is It a Speed Prior?

53:50 • 2min

17

Is There a Difference Between Time and Space?

55:59 • 3min

18

Masoptimizer

59:25 • 2min

19

Do You Know How to Train Look Up Tables?

01:01:22 • 4min

20

Is There a Way to Get a Masoprex?

01:05:15 • 5min

21

Do You Want to Avoid Mesooptimizers?

01:10:05 • 4min

22

Machine Learning and Machine Learning in the Machine Learning Community

01:14:24 • 2min

23

Is There a Way to Decompose a Module?

01:16:17 • 4min

24

Machine Learning and Inner Alignment

01:20:21 • 5min

25

Sub Optomal at Being Deceptive?

01:25:11 • 2min

26

Proxy Linement Failures

01:26:50 • 4min

27

Is Sttubing My Toe Bad?

01:31:08 • 2min

28

What You Really Care About Is, Like, the Alil Frequency of A.

01:33:11 • 2min

29

Do You Want to Train a System for a Diverse Set of Tasks?

01:34:51 • 3min

30

Is It a Trade Worth Making?

01:38:17 • 6min

31

Is There an Inner Alignment Problem?

01:44:04 • 2min

32

Do You Get Deceptive Alignment?

01:46:03 • 4min

33

Training on a Huge Corpus of Data

01:50:32 • 2min

34

How Many Martin Lutherthers Are There?

01:52:57 • 2min

35

How Do You Get a Situation Where Your System Cares About Multile Episodes?

01:54:59 • 5min

36

Nonmaapia Problems in on Line Learning

01:59:37 • 3min

37

How Confident Are You That the Arguments in the Paper Are Correct?

02:02:33 • 2min

38

The Most Dangerous Outcomes From Ai

02:04:48 • 3min

39

What Do We Do About This?

02:07:43 • 2min

40

How to Solve a Problem Before the Observation

02:10:02 • 2min

41

How to Follow Your Work?

02:11:32 • 2min