AI Safety Fundamentals cover image

Introduction to AI Control

AI Safety Fundamentals

00:00

Why Control Might Be Easier Than Alignment

The episode examines deception risks in neural networks and argues assessing capabilities can be simpler than inferring intentions.

Play episode from 00:57
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app