8 - Assistance Games with Dylan Hadfield-Menell

1

Introduction

00:00 • 2min

2

The Importance of Assistance Games in Artificial Intelligence

02:10 • 2min

3

Is There a Benefit to Assist Games in Reducing Existential Threats?

04:27 • 3min

4

Is Assistance Games a Good Way to Reduce Risk in Ai Systems?

07:26 • 2min

5

How to Reduce Exidential Risk With Assistance Games?

09:48 • 4min

6

Implementing Solutions to Assistance Games Could Provide Short Term Benefits

13:19 • 3min

7

Cooperative Inverse Reinforcement Learning - Ceril

16:02 • 2min

8

Inverse Reinforcement Learning - The Opposite of Planning

18:03 • 2min

9

Is Inverse Reinforcement Learning a Good Solution to Assistance Games?

19:47 • 5min

10

The Limits of Intentional Communication of Goals

24:43 • 3min

11

Is There a Limit on the Quantum of Information That You Can Consider?

27:16 • 3min

12

Learning the Specification of a Specific Task?

29:55 • 2min

13

CPS in Bicoin Mining

31:26 • 4min

14

Acerl Analysis - Is There a Prior?

35:21 • 4min

15

Identifying New Qualitative Features of Utility

39:18 • 2min

16

Aligning Recommender Systems With Human Values

41:07 • 3min

17

The Assistive Multi Armed Bandit

44:23 • 2min

18

Is Cooperative Inverse Reinforcement Learning a Game?

46:27 • 2min

19

Communication Equilibrium

48:38 • 5min

20

Is the Incoding Arbitrarily Complex?

53:38 • 2min

21

The Off Switch Game

55:28 • 2min

22

The Incentive for Oversight

57:55 • 2min

23

Off Switch Game Analysis

59:54 • 2min

24

I Safety and Concerns About Accidential Risk

01:02:14 • 2min

25

How Does the Uncertainty Over Human Reward Function Resolve?

01:03:52 • 3min

26

Do You Have to Guess Too Smart or Too Dumb?

01:06:44 • 2min

27

Is It Possible to Solve a Co-Operative Iorel Game?

01:08:38 • 2min

28

Is It a Good Idea or Not to Implement a Purely Off Switch Solution?

01:10:16 • 6min

29

The Human Overseaser Is Irrational, Right?

01:15:53 • 2min

30

Is There a Causal Relationship Between Objective Information and Cognitive Information?

01:17:45 • 2min

31

The Relationship Between Corrigibility and Existential Risk

01:20:09 • 3min

32

Is There a Difference Between Goal Management and Goal Achievement?

01:23:11 • 5min

33

The Off Switch Game

01:27:55 • 3min

34

Cooperative Iorel and the Off Switch Game - Is Uncertainty Important?

01:31:20 • 3min

35

Inverse Reward Design Formalizes That Inference Problem

01:34:48 • 2min

36

How Predictable Is the Predictability of Artificial Intelligence?

01:37:04 • 4min

37

Inference

01:41:27 • 2min

38

Risk Overse Trajector Optimization for Utility Functions

01:43:05 • 3min

39

Risk of Versus Planning - Why Maximize Expected Utility?

01:46:10 • 4min

40

Risk Aversion

01:49:57 • 3min

41

The Goal Achievement Component of Intelligence Is Minimizing Expected Utility

01:53:23 • 2min

42

How to Interpret a Goal in a Development Environment?

01:55:20 • 2min

43

How Many Steps to Equilibrium?

01:57:28 • 4min

44

Side Effects Mitigation and Inverse Reward Design

02:01:26 • 4min

45

The Line of Work on Inverse Reward Design

02:05:31 • 2min

46

Co-Operative Iorel and Incomplete Contracting

02:07:29 • 2min

47

Is There a Future for Cooperative Irregularity?

02:09:14 • 5min

48

Using Qualitative Research and Analysis in Deep Learning?

02:14:39 • 4min

49

The Importance of Single Agent Value Alignment

02:18:43 • 5min