The Importance of Safety Performance Tradeoffs in Existential Safety

I think there will always be some amount of safety performance trade off no matter how good of like technical progress we make on alignment so I don't do it as something that can be solved perfectly at anytime soon. One of the things that we can trade off for is you can just like test your system more before you deploy it and that means that it takes you longer to deploy the system. And another one is keeping a human in the loop so in general a lot of times you know having a human sort of overseeing the systems behavior and saying like oh this doesn't look safe let's shut it down or let's not let it take that action whatever could really harm the performance of the system.

Play episode from 02:19:51

Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app