Chapters
Transcript
Episode notes
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23
Introduction
00:00 • 3min
How Much Training Did You Do to Train These Adversarial Policies?
02:35 • 2min
How to Train a Kicker to Block a Bull
04:13 • 2min
Rarbotic - A Multiagent Environment
06:05 • 2min
Observation Rate Increases With Self Placement Techniques
08:19 • 2min
Is It Like a Rule of Fum in a Training Setting?
10:44 • 2min
Is There a Difference Between the Input and the Output of a Robotic Environment?
13:07 • 2min
Is the Adversarial Victim Always the Kicker?
15:19 • 2min
What Happens When You Blindfold or Mask Victim Policies?
17:22 • 2min
Sumo Ant Sumo Wrestling
19:43 • 2min
How to Control a Victin Policy in Sumo?
21:22 • 2min
A, I Think We Didn't Choose a Particular Layer of a Network.
23:44 • 2min
Is the Space of Policies Transifret?
25:34 • 4min
Training Adversarial Policies
29:29 • 4min
A, You're Gonna Be Able to Discover a Blind Spot
33:44 • 2min
Kick and Defent
35:48 • 2min
Avas Writer, What's the Reception of the Work?
37:53 • 3min
Are There Any Misconceptions About Security?
40:32 • 2min
Is There a Preprint of This Paper?
42:33 • 4min
Is It Safe to Say That You're Interested in Ai Alignment?
46:06 • 2min
Is There a Problem in the Safety Community?
47:42 • 5min
Is There a Reliability of the Feedback Mechanism?
52:47 • 3min
Is There a Human Level Classification Accuracy on Natural Images?
55:30 • 3min