AXRP - the AI X-risk Research Podcast

11 - Attainable Utility and Power with Alex Turner

Sep 25, 2021
Ask episode
Chapters
Transcript
Episode notes
1
Introduction
00:00 • 2min
2
The Second Hope of Side Effect Research
02:24 • 2min
3
Is Objective Maximization a Bad Frame?
04:47 • 3min
4
The Top 3 Properties of a Prism
07:29 • 2min
5
How to Preserve the Ability to Do Random Things?
09:06 • 2min
6
Is There a Good Noap Policy?
11:25 • 2min
7
Is It Easier to Measure the Human Action Space?
13:02 • 5min
8
The Inaction Policy of Maximizing Power
18:05 • 2min
9
Power Maintenance vs Power Maximization
19:46 • 2min
10
The Difference Between Agent and Non Agent Parts of the Environment
21:19 • 4min
11
What's the Alternative to Roll Outs?
25:09 • 2min
12
How to Preserve the Utility of a Reward Function, Right?
27:07 • 2min
13
Aspects of the Utility Maximization Framework
29:01 • 2min
14
Getting the Right Objective and Loading the Objective Into the Agent
30:31 • 3min
15
The Relationship Between Corrigeibility and Utility Preservation
33:17 • 4min
16
Is It Useful to Have an Agent That Preserves Its Ability to Achieve a Wide Range of Things?
37:32 • 2min
17
The Relationship Between Power and Utility Preservation
39:15 • 3min
18
Is It the Case That Automa Policies Tend to Seek Power All the Time?
42:20 • 2min
19
What Kinds of Symmetries Do You Need for Instrumental Conversion?
44:37 • 5min
20
Is There a Game Over Screen?
49:32 • 1min
21
Is That How Instrumental Convergence Works?
51:01 • 1min
22
The Reward Function for an Ai System Isn't Optimistic
52:31 • 2min
23
How Does This Fit Into an Alignment?
54:05 • 3min
24
Ike, Is It Motivating the Ai Systems to Have Really Bad Consequences?
56:42 • 2min
25
The Chain of Arguments, Sir?
58:36 • 2min
26
Is Human Approval of an Action Based on a I Gaining Some Power?
01:00:10 • 3min
27
Is There a Free Lunch Theorem?
01:03:08 • 3min
28
Is There a Necessity in the Environment?
01:06:32 • 2min
29
Is There a Way to Maximize Your Utility?
01:08:54 • 3min
30
A Power Query - What Extensions Are Most Valuable to You?
01:11:37 • 4min
31
Is There a Bounded Optimality?
01:15:11 • 2min
32
Does Alex Turner Research Taste Look Like?
01:17:27 • 4min
33
What's the Alex Turner a Genda?
01:21:39 • 3min
34
What's Like a Steelman of the Case Gainst Power Seeking or Against?
01:24:46 • 3min