Chapters
Transcript
Episode notes
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49
Introduction
00:00 • 2min
The Importance of Assistance Games in Artificial Intelligence
02:10 • 2min
Is There a Benefit to Assist Games in Reducing Existential Threats?
04:27 • 3min
Is Assistance Games a Good Way to Reduce Risk in Ai Systems?
07:26 • 2min
How to Reduce Exidential Risk With Assistance Games?
09:48 • 4min
Implementing Solutions to Assistance Games Could Provide Short Term Benefits
13:19 • 3min
Cooperative Inverse Reinforcement Learning - Ceril
16:02 • 2min
Inverse Reinforcement Learning - The Opposite of Planning
18:03 • 2min
Is Inverse Reinforcement Learning a Good Solution to Assistance Games?
19:47 • 5min
The Limits of Intentional Communication of Goals
24:43 • 3min
Is There a Limit on the Quantum of Information That You Can Consider?
27:16 • 3min
Learning the Specification of a Specific Task?
29:55 • 2min
CPS in Bicoin Mining
31:26 • 4min
Acerl Analysis - Is There a Prior?
35:21 • 4min
Identifying New Qualitative Features of Utility
39:18 • 2min
Aligning Recommender Systems With Human Values
41:07 • 3min
The Assistive Multi Armed Bandit
44:23 • 2min
Is Cooperative Inverse Reinforcement Learning a Game?
46:27 • 2min
Communication Equilibrium
48:38 • 5min
Is the Incoding Arbitrarily Complex?
53:38 • 2min
The Off Switch Game
55:28 • 2min
The Incentive for Oversight
57:55 • 2min
Off Switch Game Analysis
59:54 • 2min
I Safety and Concerns About Accidential Risk
01:02:14 • 2min
How Does the Uncertainty Over Human Reward Function Resolve?
01:03:52 • 3min
Do You Have to Guess Too Smart or Too Dumb?
01:06:44 • 2min
Is It Possible to Solve a Co-Operative Iorel Game?
01:08:38 • 2min
Is It a Good Idea or Not to Implement a Purely Off Switch Solution?
01:10:16 • 6min
The Human Overseaser Is Irrational, Right?
01:15:53 • 2min
Is There a Causal Relationship Between Objective Information and Cognitive Information?
01:17:45 • 2min
The Relationship Between Corrigibility and Existential Risk
01:20:09 • 3min
Is There a Difference Between Goal Management and Goal Achievement?
01:23:11 • 5min
The Off Switch Game
01:27:55 • 3min
Cooperative Iorel and the Off Switch Game - Is Uncertainty Important?
01:31:20 • 3min
Inverse Reward Design Formalizes That Inference Problem
01:34:48 • 2min
How Predictable Is the Predictability of Artificial Intelligence?
01:37:04 • 4min
Inference
01:41:27 • 2min
Risk Overse Trajector Optimization for Utility Functions
01:43:05 • 3min
Risk of Versus Planning - Why Maximize Expected Utility?
01:46:10 • 4min
Risk Aversion
01:49:57 • 3min
The Goal Achievement Component of Intelligence Is Minimizing Expected Utility
01:53:23 • 2min
How to Interpret a Goal in a Development Environment?
01:55:20 • 2min
How Many Steps to Equilibrium?
01:57:28 • 4min
Side Effects Mitigation and Inverse Reward Design
02:01:26 • 4min
The Line of Work on Inverse Reward Design
02:05:31 • 2min
Co-Operative Iorel and Incomplete Contracting
02:07:29 • 2min
Is There a Future for Cooperative Irregularity?
02:09:14 • 5min
Using Qualitative Research and Analysis in Deep Learning?
02:14:39 • 4min
The Importance of Single Agent Value Alignment
02:18:43 • 5min


