
23 - Mechanistic Anomaly Detection with Mark Xu
AXRP - the AI X-risk Research Podcast
The Mechanisms of Manipulation of Noise
The thing I think I'm not getting is like this idea of like manipulating the noise, which seems like a model dependent thing. So just because your AI wanted to make there be a diamond does not imply that the like particular action it took will in fact make it be there's a diamond. And so you still have to talk about the particular mechanism of action for the like particularaction your AI decided.
00:00
Transcript
Play full episode
Remember Everything You Learn from Podcasts
Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.