AXRP - the AI X-risk Research Podcast cover image

12 - AI Existential Risk with Paul Christiano

AXRP - the AI X-risk Research Podcast

CHAPTER

How Much Do You Rely on Hindsight to Evaluate Behaviour?

Aur ai systems can do tasks that are very hard for humans to value is answers. The more you want or rely on foresight, the more plausible it is that the human doesn't understand well enough to do the operation. It depends alot how long we make the behaviors and how much hindsight we give the human evaluators. And in general, that's like part of the tension or game: lie really good at one thing but not so good at another.

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner