Chapters
Transcript
Episode notes
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47
Introduction
00:00 • 3min
The Problem With Mechanistic Anomalies
02:51 • 3min
The Importance of Rationality in AI
05:33 • 2min
The Future of AI Alignment
07:08 • 2min
How to Rule in Bad Behavior
09:29 • 5min
The in Distribution and Out of Distribution Anomalies
14:25 • 2min
The Importance of Mechanistic Anomalies in Training
16:43 • 2min
The Problem With Mechanistic Anomalies in AI
18:13 • 4min
The Importance of Rationality in AI
22:00 • 2min
The Importance of Mechanistic Anomaly Detection
23:59 • 2min
How to Analogize AI's Actions
26:05 • 2min
The Importance of Mechanistic Anomaly Detection
27:36 • 3min
The Mechanistic Anomaly of Diamonds
30:33 • 2min
The Importance of Predicting the Future
32:19 • 5min
The Diamond: A Metaphor for AI's Ability to Take Action
37:17 • 2min
The Problem With Mechanistic Anomaly Detection
38:58 • 3min
How to Drop Out Mechanisms in AI Training
42:13 • 4min
The Mechanism That Causes Both Sensors to Be On
45:48 • 2min
The Different Mechanisms That Explain Noise and Sensor One
47:59 • 4min
The Mechanisms of Manipulation of Noise
52:01 • 2min
How to Know if Your AI Wants to Make Things Happen
53:52 • 3min
How to Predict a Behavior
57:17 • 2min
AI's Potential to Improve Human Sensitivity
59:15 • 4min
How to Improve Sensor Readings With AI
01:02:53 • 2min
The Limits of Mechanistic Anomaly Detection
01:04:59 • 2min
Distribution vs. Out of Distribution Anomaly Detection
01:07:25 • 3min
The Importance of Interpretability in Mechanistic Anomalies
01:10:07 • 3min
Formalizing the Presumption of Independence
01:12:38 • 5min
Heuristic Arguments for Mechanistic Anomalies
01:17:47 • 2min
The Energy Argument in Physics
01:20:02 • 3min
The Role of Heuristics in Acoustical Arguments
01:22:58 • 2min
The Presumption of Independence in Stiff Simulations
01:24:52 • 2min
Heuristic Arguments Give Quote Unquote the Wrong Answer
01:27:17 • 4min
The Heuristic Argument for the N Equals Three Case
01:31:00 • 2min
How to Maximize the Entropy of Your Probability Distribution
01:32:49 • 4min
The Maximum Entropy Distribution of a Circuit
01:37:16 • 2min
The Inevitable Property of Maximum Entropy
01:39:03 • 3min
The Robustness of Heuristic Estimators
01:41:49 • 3min
The Impossibility of Being Adversarily Robust in AI
01:44:40 • 3min
The Heuristic Estimation of Quantity
01:48:10 • 2min
How to Deal With Adversarial Robustness in the Search Process
01:50:35 • 2min
Heuristic Estimates for Deficient Quantities
01:53:02 • 2min
Heuristic Arguments for Neural Nets
01:54:54 • 2min
How to Be a Good Heuristic Estimator
01:57:22 • 3min
How to Formalize Heuristic Arguments to Make Them Findable
02:00:15 • 2min
Redwood's Experimental Work on Mechanistic Anomalies
02:01:55 • 2min
The Importance of Probability in Research
02:03:28 • 2min