The Nonlinear Library

AF - Inducing Unprompted Misalignment in LLMs by Sam Svenningsen

Apr 19, 2024
Ask episode
Chapters
Transcript
Episode notes