AXRP - the AI X-risk Research Podcast cover image

24 - Superalignment with Jan Leike

AXRP - the AI X-risk Research Podcast

00:00

Scalable Oversight for AI Alignment Research

JADJBT is looking at ways to leverage AI to assist human evaluation on difficult tasks. It's the idea that like, we solve the problems of how to align something roughly human level. And then there's like additional problems as it gets smarter. So I imagine that an actual solution to aligning superintelligence will look quite different from what we do today.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app