AXRP - the AI X-risk Research Podcast cover image

7 - Side Effects with Victoria Krakovna

AXRP - the AI X-risk Research Podcast

CHAPTER

Can You Give Us a Summary of Which Formulations Are Actually Past the Most Test Cases?

One of the test cases is like testing for this sike, undesirable offsetting incentive. But as i sort of describe in the future, task paper have up dated towards te offsetting not necessarily being bad. And so there's kind of more new ants about this now, where you can have, you know, good upsetting and bad ufsetting.

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner