AI Safety Fundamentals

Interpretability in the Wild: A Circuit for Indirect Object Identification in GPT-2 Small

Jan 4, 2025
Ask episode
Chapters
Transcript
Episode notes