Towards Data Science cover image

130. Edouard Harris - New Research: Advanced AI may tend to seek power *by default*

Towards Data Science

00:00

Exactly. You Can Do This With Any Level of Correlation, Right?

A positive correlated reward is something like there's a piece of cheese. And if the rewards for the human and the AI are the same, then it doesn't matter which one of the two gets the cheese. In practice in terms of what we actually investigate, we investigate in the spectrum,. I was looking at the spectrum between, you know, AI does not care at all what the human wants, all the way up to the AI and the human both want exactly the same thing.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app