4min chapter

AXRP - the AI X-risk Research Podcast cover image

12 - AI Existential Risk with Paul Christiano

AXRP - the AI X-risk Research Podcast

CHAPTER

How Much Do You Rely on Hindsight to Evaluate Behaviour?

Aur ai systems can do tasks that are very hard for humans to value is answers. The more you want or rely on foresight, the more plausible it is that the human doesn't understand well enough to do the operation. It depends alot how long we make the behaviors and how much hindsight we give the human evaluators. And in general, that's like part of the tension or game: lie really good at one thing but not so good at another.

00:00

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode