Is There a Negative Outcome for AI?

There is experimental evidence in every kind of statistical setup that we've tried the default outcome seems to be competition. The idea here is like well you know if we want a world that at least like at a bare minimum this this AI system is not doing things that we don't like we have to do more than zero effort to get a non negative outcome for ourselves. We maybe don't have to be super aligned like crazy aligned but we at least have to agree on like 20% of things on average something I don't know what the actual number is.

Play episode from 47:25

Transcript

Episode notes

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app

130. Edouard Harris - New Research: Advanced AI may tend to seek power *by default*

Towards Data Science

Is There a Negative Outcome for AI?

The AI-powered Podcast Player

130. Edouard Harris - New Research: Advanced AI may tend to seek power by default