AI-powered
podcast player
Listen to all your favourite podcasts with AI-powered features
Scalable Oversight of AI Systems
You know, you wrote a really interesting article on scalable oversight based on experiments that offers some hope that humans may be able to help AI to not go off the rails. So how should we think about oversight of AI systems that may become more capable than we are in many ways so that they align more closely with human goals?