
Introduction to AI Control
AI Safety Fundamentals
00:00
Why Control Might Be Easier Than Alignment
The episode examines deception risks in neural networks and argues assessing capabilities can be simpler than inferring intentions.
Play episode from 00:57
Transcript


