3min chapter

AXRP - the AI X-risk Research Podcast cover image

2 - Learning Human Biases with Rohin Shah

AXRP - the AI X-risk Research Podcast

CHAPTER

How to Model Human Bias in a Logical Environment

i think in this case it was actually that the transition dynamics were the same across all the environments, and the value crition metork was allowed to learn a warped version of them. This is more like, when we came up with our model of what the human human planer was doing, we put into it this, like, incorrect model of how the world works. So that is still a difference, but it isn'tlike we learned a planer that gets the correct transition dynamics and then works them a like that. Ah, or possibly i just said a wrong thing earlier. I mean, imean takin about, you've got af of some something ther,. ye?

00:00

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode