Untrusted Models Monitoring Each Other

Sarah discusses using multiple untrusted model instances to monitor one another and mitigation for collusion risks.

Play episode from 05:33

Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!