AI-powered
podcast player
Listen to all your favourite podcasts with AI-powered features
The Horrors of Losing Control of an Aligned System
Do you think it's possible that a lot, whatever that special thing is, let's explore what that special thing could be, that thing appears often in the objective function. Why? You can hope that a particular set of winning lottery numbers come up and it doesn't make the lottery balls come up that way. There's a science problem. We are trying to predict what happens with AI systems that you try to optimize to imitate humans. So if you don't mind my like taking some slight control of things and steering around to what I think is like a good place to start, I just failed to solve the control problem.