AXRP - the AI X-risk Research Podcast cover image

12 - AI Existential Risk with Paul Christiano

AXRP - the AI X-risk Research Podcast

00:00

Are Your Doing Well or Poorly?

Ai systems are designed to do useful tasks, at least most of them. But they can also have bad motivations and try to get a signal that says you're doing the task well. And so what it learns is that it should definitely not do that. Focusd on making logistics good. If i mess with te information about how well logistics is going, i'd better not let them ever get back into the datus center for example.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app