Scalable Oversight for AI Alignment Research

JADJBT is looking at ways to leverage AI to assist human evaluation on difficult tasks. It's the idea that like, we solve the problems of how to align something roughly human level. And then there's like additional problems as it gets smarter. So I imagine that an actual solution to aligning superintelligence will look quite different from what we do today.

Play episode from 18:44

Transcript

Episode notes

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app