The Nonlinear Library

AF - You can remove GPT2's LayerNorm by fine-tuning for an hour by Stefan Heimersheim

Aug 8, 2024
Ask episode
Chapters
Transcript
Episode notes