AI-powered
podcast player
Listen to all your favourite podcasts with AI-powered features
The Importance of Safety Performance Tradeoffs in Existential Safety
I think there will always be some amount of safety performance trade off no matter how good of like technical progress we make on alignment so I don't do it as something that can be solved perfectly at anytime soon. One of the things that we can trade off for is you can just like test your system more before you deploy it and that means that it takes you longer to deploy the system. And another one is keeping a human in the loop so in general a lot of times you know having a human sort of overseeing the systems behavior and saying like oh this doesn't look safe let's shut it down or let's not let it take that action whatever could really harm the performance of the system.