Scalable Oversight of AI Systems

You know, you wrote a really interesting article on scalable oversight based on experiments that offers some hope that humans may be able to help AI to not go off the rails. So how should we think about oversight of AI systems that may become more capable than we are in many ways so that they align more closely with human goals?

Play episode from 48:41

Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app