AXRP - the AI X-risk Research Podcast

8 - Assistance Games with Dylan Hadfield-Menell

Jun 8, 2021
Ask episode
Chapters
Transcript
Episode notes
1
Introduction
00:00 • 2min
2
The Importance of Assistance Games in Artificial Intelligence
02:10 • 2min
3
Is There a Benefit to Assist Games in Reducing Existential Threats?
04:27 • 3min
4
Is Assistance Games a Good Way to Reduce Risk in Ai Systems?
07:26 • 2min
5
How to Reduce Exidential Risk With Assistance Games?
09:48 • 4min
6
Implementing Solutions to Assistance Games Could Provide Short Term Benefits
13:19 • 3min
7
Cooperative Inverse Reinforcement Learning - Ceril
16:02 • 2min
8
Inverse Reinforcement Learning - The Opposite of Planning
18:03 • 2min
9
Is Inverse Reinforcement Learning a Good Solution to Assistance Games?
19:47 • 5min
10
The Limits of Intentional Communication of Goals
24:43 • 3min
11
Is There a Limit on the Quantum of Information That You Can Consider?
27:16 • 3min
12
Learning the Specification of a Specific Task?
29:55 • 2min
13
CPS in Bicoin Mining
31:26 • 4min
14
Acerl Analysis - Is There a Prior?
35:21 • 4min
15
Identifying New Qualitative Features of Utility
39:18 • 2min
16
Aligning Recommender Systems With Human Values
41:07 • 3min
17
The Assistive Multi Armed Bandit
44:23 • 2min
18
Is Cooperative Inverse Reinforcement Learning a Game?
46:27 • 2min
19
Communication Equilibrium
48:38 • 5min
20
Is the Incoding Arbitrarily Complex?
53:38 • 2min
21
The Off Switch Game
55:28 • 2min
22
The Incentive for Oversight
57:55 • 2min
23
Off Switch Game Analysis
59:54 • 2min
24
I Safety and Concerns About Accidential Risk
01:02:14 • 2min
25
How Does the Uncertainty Over Human Reward Function Resolve?
01:03:52 • 3min
26
Do You Have to Guess Too Smart or Too Dumb?
01:06:44 • 2min
27
Is It Possible to Solve a Co-Operative Iorel Game?
01:08:38 • 2min
28
Is It a Good Idea or Not to Implement a Purely Off Switch Solution?
01:10:16 • 6min
29
The Human Overseaser Is Irrational, Right?
01:15:53 • 2min
30
Is There a Causal Relationship Between Objective Information and Cognitive Information?
01:17:45 • 2min
31
The Relationship Between Corrigibility and Existential Risk
01:20:09 • 3min
32
Is There a Difference Between Goal Management and Goal Achievement?
01:23:11 • 5min
33
The Off Switch Game
01:27:55 • 3min
34
Cooperative Iorel and the Off Switch Game - Is Uncertainty Important?
01:31:20 • 3min
35
Inverse Reward Design Formalizes That Inference Problem
01:34:48 • 2min
36
How Predictable Is the Predictability of Artificial Intelligence?
01:37:04 • 4min
37
Inference
01:41:27 • 2min
38
Risk Overse Trajector Optimization for Utility Functions
01:43:05 • 3min
39
Risk of Versus Planning - Why Maximize Expected Utility?
01:46:10 • 4min
40
Risk Aversion
01:49:57 • 3min
41
The Goal Achievement Component of Intelligence Is Minimizing Expected Utility
01:53:23 • 2min
42
How to Interpret a Goal in a Development Environment?
01:55:20 • 2min
43
How Many Steps to Equilibrium?
01:57:28 • 4min
44
Side Effects Mitigation and Inverse Reward Design
02:01:26 • 4min
45
The Line of Work on Inverse Reward Design
02:05:31 • 2min
46
Co-Operative Iorel and Incomplete Contracting
02:07:29 • 2min
47
Is There a Future for Cooperative Irregularity?
02:09:14 • 5min
48
Using Qualitative Research and Analysis in Deep Learning?
02:14:39 • 4min
49
The Importance of Single Agent Value Alignment
02:18:43 • 5min