AXRP - the AI X-risk Research Podcast cover image

24 - Superalignment with Jan Leike

AXRP - the AI X-risk Research Podcast

00:00

The Importance of Using a Criticism Model in AI Alignment Research

Jeff: The idea is like, we're adding more and more AI knowledge to the evaluation portion of AI alignment research. And by having, by doing it this iterative way, like the ideas that we can like consistently give to a good training signal. Jeff: So for example, our other Jeff is kind of like, you know, the simplest one where you don't use any assistants.

Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner
Get the app