AXRP - the AI X-risk Research Podcast cover image

23 - Mechanistic Anomaly Detection with Mark Xu

AXRP - the AI X-risk Research Podcast

00:00

The Importance of Mechanistic Anomaly Detection

I'm not sure what notion of mechanism of reasoning mechanism would distinguish between those two yeah so I want to talk about the second thing I was going to say which great hopefully addresses this issue. So our hope is that we're not going to do mechanistic anomaly detection with respect to like your AI coming up with plans to protect well maybe we are but here's a simple setting in which there's a specific kind of mechanisticomaly detection that I think might be sufficient. We have some cameras and then some process where we like look at the cameras and we decide whether or not the diamond is still theremaybe we like look for specific patterns of pixels and like various diamond looking shapes etc etc if that

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app