AXRP - the AI X-risk Research Podcast cover image

23 - Mechanistic Anomaly Detection with Mark Xu

AXRP - the AI X-risk Research Podcast

CHAPTER

The Importance of Predicting the Future

I think I'm imagining neither it's possible that we should just stop talking about this particular hope because I also don't think it's particularly plausible but if you imagine that like your policy is using some prediction of the future to select action A so as long as your evaluation of the action is like as good as the evaluation your policy is used in some sense then I think they don't have to be like literally the same. So our goal is to be able to talk about this sort of like physical actual diamond in the world outside of how the action achieved the diamond still being there if that makes sense. We do want to do mechanistic knowledge detection on the part where like the diamond appears on

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner