2min chapter

AXRP - the AI X-risk Research Podcast cover image

2 - Learning Human Biases with Rohin Shah

AXRP - the AI X-risk Research Podcast

CHAPTER

Is There a Reward for Inferring What You Want?

The technique is trying to infer what the reward is. It's more like assuming a magic camera that gets to watch the person as they do, they go about their day to day life. So li you just sort of watch the human go around with their life, and you're like, ok, based on the fact that they, you know, a had cake to day, i can infer that they would like cake or something. Am i wrong about this? No, that seems right. I think you're oure complating the like evaluation that we did with the technique.

00:00

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode