
"An artificially structured argument for expecting AGI ruin" by Rob Bensinger
LessWrong (Curated & Popular)
00:00
How to Avert Instrumental Pressures
One method for averting instrumental pressures would be to train an AGI to halt its thought processes whenever it starts to approach dangerous topics. This sort of approach is likely to either fail catastrophically or cripple the system because of issues like unforeseen maxima and nearest unblocked neighbors. We can try to build AGI systems to actively want to stay mild, but this requires us to solve an unusually difficult form of the value loading problem. Unusually difficult because mildness actively runs counter to effectiveness and efficiency. And quoting AGI ruin illicitly thalities - you can't bring the coffee if you're dead, for almost every kind of coffee.
Play episode from 31:01
Transcript


