
"Model Organisms of Misalignment: The Case for a New Pillar of Alignment Research" by evhub, Nicholas Schiefer, Carson Denison, Ethan Perez
LessWrong (Curated & Popular)
00:00
Introduction
This chapter emphasizes the importance of researching model organisms of misalignment and the potential existential threats they pose, highlighting the lack of empirical evidence for concerning sources of existential risk like deceptive AI systems and specific examples of misalignment.
Transcript
Play full episode