AI Safety Fundamentals

Understanding Intermediate Layers Using Linear Classifier Probes

Jan 4, 2025
Ask episode
Chapters
Transcript
Episode notes