LessWrong (Curated & Popular) cover image

"Model Organisms of Misalignment: The Case for a New Pillar of Alignment Research" by evhub, Nicholas Schiefer, Carson Denison, Ethan Perez

LessWrong (Curated & Popular)

00:00

Introduction

This chapter emphasizes the importance of researching model organisms of misalignment and the potential existential threats they pose, highlighting the lack of empirical evidence for concerning sources of existential risk like deceptive AI systems and specific examples of misalignment.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app