AI Safety Fundamentals cover image

Introduction to AI Control

AI Safety Fundamentals

00:00

Untrusted Models Monitoring Each Other

Sarah discusses using multiple untrusted model instances to monitor one another and mitigation for collusion risks.

Play episode from 05:33
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app