Rethinking Interpretability in Computational Models

This chapter critiques the simplistic view of interpretability in computational models, arguing that human-interpretable features may not always align with inputs or outputs. It calls for new methods to identify context-dependent features, encouraging a reevaluation of how computational knowledge is represented.

Play episode from 15:56

Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app