Towards Data Science cover image

130. Edouard Harris - New Research: Advanced AI may tend to seek power *by default*

Towards Data Science

00:00

Is There a Negative Outcome for AI?

There is experimental evidence in every kind of statistical setup that we've tried the default outcome seems to be competition. The idea here is like well you know if we want a world that at least like at a bare minimum this this AI system is not doing things that we don't like we have to do more than zero effort to get a non negative outcome for ourselves. We maybe don't have to be super aligned like crazy aligned but we at least have to agree on like 20% of things on average something I don't know what the actual number is.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app