AXRP - the AI X-risk Research Podcast cover image

23 - Mechanistic Anomaly Detection with Mark Xu

AXRP - the AI X-risk Research Podcast

CHAPTER

The Importance of Mechanistic Anomaly Detection

I'm not sure what notion of mechanism of reasoning mechanism would distinguish between those two yeah so I want to talk about the second thing I was going to say which great hopefully addresses this issue. So our hope is that we're not going to do mechanistic anomaly detection with respect to like your AI coming up with plans to protect well maybe we are but here's a simple setting in which there's a specific kind of mechanisticomaly detection that I think might be sufficient. We have some cameras and then some process where we like look at the cameras and we decide whether or not the diamond is still theremaybe we like look for specific patterns of pixels and like various diamond looking shapes etc etc if that

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner