The Nonlinear Library: LessWrong

LW - [Interim research report] Activation plateaus and sensitive directions in GPT2 by StefanHex

Jul 5, 2024
Ask episode
Chapters
Transcript
Episode notes