2min chapter

AXRP - the AI X-risk Research Podcast cover image

7 - Side Effects with Victoria Krakovna

AXRP - the AI X-risk Research Podcast

CHAPTER

Is It Possible to Integrate Human Preferences in the Impact Measure?

Human preferences could come in as weights on different possible auxiliary goals. So far, think the most complex environment that the impact measures were tested on was a safe life with some encouraging results. But i think itoud beinteresting to see it on environments that gind, that are not good worlds like deep mined lab or one of those things we're like, you know, the agent walks around a room with walls or whatever,. Any different objects? Ah, so, yes, i think, i think that would be interesting and probably illuminating, but also, ike, not very easy to implemeent.

00:00

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode