LessWrong (Curated & Popular)

“Self-fulfilling misalignment data might be poisoning our AI models” by TurnTrout

Mar 4, 2025
Ask episode
Chapters
Transcript
Episode notes