The Inside View cover image

Ethan Perez–Inverse Scaling, Language Feedback, Red Teaming

The Inside View

00:00

Is the Paper Clip Maximizer Biase to Our Paper Clips?

models can produce harmful distributions. How do we catch those? Is the besic idea here that the paper clip maximizer would be biase to our paper clips. And so if we cannot find biases in our models right now, it wouldike to help us find biases in like, small malicious optimiser  - i don't know too much white meals in the data.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app