The Importance of Using a Criticism Model in AI Alignment Research

Jeff: The idea is like, we're adding more and more AI knowledge to the evaluation portion of AI alignment research. And by having, by doing it this iterative way, like the ideas that we can like consistently give to a good training signal. Jeff: So for example, our other Jeff is kind of like, you know, the simplest one where you don't use any assistants.

Play episode from 32:36

Transcript

Episode notes

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app