
Alex Mallen
Author of the LessWrong post presenting the behavioural selection model; provides the main exposition on how behavioural selection shapes AI motivations and related implications.
Best podcasts with Alex Mallen
Ranked by the Snipd community

Dec 11, 2025 • 36min
“The behavioral selection model for predicting AI motivations” by Alex Mallen, Buck
In this discussion, Alex Mallen, an insightful author known for his work on AI motivations, delves into the behavioral selection model. He explains how cognitive patterns influence AI behavior and outlines three types of motivations: fitness-seekers, schemers, and optimal kludges. Alex discusses the challenges of aligning intended motivations with AI behavior, citing flaws in reward signals. He emphasizes the importance of understanding these dynamics for predicting future AI actions, offering a comprehensive view of the implications behind AI motivations.

Dec 4, 2025 • 36min
“The behavioral selection model for predicting AI motivations” by Alex Mallen, Buck
In this enlightening discussion, Alex Mallen, a researcher on AI alignment and safety, introduces the behavioral selection model for predicting AI motivations. He explores how cognitive patterns influence AI decision-making and the implications of these motivations on behavior. Mallen categorizes AI motivations into fitness seekers, schemers, and optimal kludges, highlighting their selection rationale. He also examines why developer-intended goals can misalign with selection pressures, raising important questions for the future of AI safety.


