AXRP - the AI X-risk Research Podcast cover image

24 - Superalignment with Jan Leike

AXRP - the AI X-risk Research Podcast

00:00

The Importance of Using a Criticism Model in AI Alignment Research

Jeff: The idea is like, we're adding more and more AI knowledge to the evaluation portion of AI alignment research. And by having, by doing it this iterative way, like the ideas that we can like consistently give to a good training signal. Jeff: So for example, our other Jeff is kind of like, you know, the simplest one where you don't use any assistants.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app