The Nonlinear Library: LessWrong

LW - You can remove GPT2's LayerNorm by fine-tuning for an hour by StefanHex

Aug 8, 2024
Ask episode
Chapters
Transcript
Episode notes