Towards Data Science cover image

130. Edouard Harris - New Research: Advanced AI may tend to seek power *by default*

Towards Data Science

CHAPTER

Exactly. You Can Do This With Any Level of Correlation, Right?

A positive correlated reward is something like there's a piece of cheese. And if the rewards for the human and the AI are the same, then it doesn't matter which one of the two gets the cheese. In practice in terms of what we actually investigate, we investigate in the spectrum,. I was looking at the spectrum between, you know, AI does not care at all what the human wants, all the way up to the AI and the human both want exactly the same thing.

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner