Exploring the Untrusted Smart vs. Trusted Dumb Models Dilemma in Safety Research

Exploring the challenges of using trusted models as analogies for human behavior in safety research, including the issues of cost, latency, and evasion of countermeasures by untrusted models.

Play episode from 06:10

Transcript

Episode notes

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app