LessWrong (Curated & Popular) cover image

"An artificially structured argument for expecting AGI ruin" by Rob Bensinger

LessWrong (Curated & Popular)

00:00

How to Avert Instrumental Pressures

One method for averting instrumental pressures would be to train an AGI to halt its thought processes whenever it starts to approach dangerous topics. This sort of approach is likely to either fail catastrophically or cripple the system because of issues like unforeseen maxima and nearest unblocked neighbors. We can try to build AGI systems to actively want to stay mild, but this requires us to solve an unusually difficult form of the value loading problem. Unusually difficult because mildness actively runs counter to effectiveness and efficiency. And quoting AGI ruin illicitly thalities - you can't bring the coffee if you're dead, for almost every kind of coffee.

Play episode from 31:01
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app