Why Control Might Be Easier Than Alignment

The episode examines deception risks in neural networks and argues assessing capabilities can be simpler than inferring intentions.

Play episode from 00:57

Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!