The Discriminator Criticism App

In the critiques paper that we published last year you basically do randomized controlled trials with targeted perturbations. By training it essentially to be a discriminator between the good version and the flawed version. And then you like check that with like if you ask the model or like the arrow Jeff wasn't out the model to write a critique of the code how often does it actually writing about the floor? Now you get like this critique accuracy equivalent. And that's what we call the discriminator critique app.

Play episode from 34:53

Transcript

Episode notes

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app