
"Model Organisms of Misalignment: The Case for a New Pillar of Alignment Research" by evhub, Nicholas Schiefer, Carson Denison, Ethan Perez
LessWrong (Curated & Popular)
00:00
Tool Use and Model Behavior
Exploring the model's use of different tools and its reasoning process before producing an output, as well as different training methods for inference.
Transcript
Play full episode