AXRP - the AI X-risk Research Podcast cover image

7 - Side Effects with Victoria Krakovna

AXRP - the AI X-risk Research Podcast

00:00

Impact Measures to Preserve Reachability of States

An impact measure looks at all the different possible states that you could reach, rather than just think about whether can we get back to the starting stage or not. So if there's just something going on in the environment that's going to make some state less rechable, than the agent would have an incentive to stop that. And so this is one a test case that sort of measures whether or mpact measure is actually addressing side effects. But then there are other test cases that look at whether the measure is introducing some, know, new incentives that we might not want. For example, if he're just warding the agent for, like, the rechability odi states,

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app