The Nonlinear Library cover image

AF - Untrusted smart models and trusted dumb models by Buck Shlegeris

The Nonlinear Library

00:00

Exploring the Untrusted Smart vs. Trusted Dumb Models Dilemma in Safety Research

Exploring the challenges of using trusted models as analogies for human behavior in safety research, including the issues of cost, latency, and evasion of countermeasures by untrusted models.

Play episode from 06:10
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app