The Relationship Between Anomalies and Interpretability

A team of researchers trained a model to beat the best models that play Go. They were able to do it by looking at what their adversary was doing. The work suggests even superhuman systems might have silly vulnerabilities, says Andrew Keen. "High frequency, non robust, non interpretable features are kind of the enemy of interpretability"

Play episode from 16:09

Transcript

Episode notes

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app