AXRP - the AI X-risk Research Podcast cover image

7 - Side Effects with Victoria Krakovna

AXRP - the AI X-risk Research Podcast

CHAPTER

Impact Measures to Preserve Reachability of States

An impact measure looks at all the different possible states that you could reach, rather than just think about whether can we get back to the starting stage or not. So if there's just something going on in the environment that's going to make some state less rechable, than the agent would have an incentive to stop that. And so this is one a test case that sort of measures whether or mpact measure is actually addressing side effects. But then there are other test cases that look at whether the measure is introducing some, know, new incentives that we might not want. For example, if he're just warding the agent for, like, the rechability odi states,

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner