undefined

Jan Kulveit

Author of the LessWrong post "The Pando Problem: Rethinking AI Individuality"

Top 5 podcasts with Jan Kulveit

Ranked by the Snipd community
undefined
6 snips
Mar 29, 2025 • 6min

“Conceptual Rounding Errors” by Jan_Kulveit

Join Jan Kulveit, author and thinker focused on cognitive biases, as he delves into 'Conceptual Rounding Errors.' He discusses how our minds can overly compress new ideas, leading us to miss nuanced differences from existing concepts. Jan reveals how this mechanism can hinder our understanding, especially in complex fields like AI alignment. He shares practical strategies for enhancing cognitive clarity and metacognitive awareness, ensuring we differentiate novelty from familiarity effectively.
undefined
May 30, 2024 • 2h 22min

32 - Understanding Agency with Jan Kulveit

Jan Kulveit, who leads the Alignment of Complex Systems research group, dives into the fascinating intersection of AI and human cognition. He discusses active inference, the differences between large language models and the human brain, and how feedback loops influence behavior. The conversation explores hierarchical agency, the complexities of aligning AI with human values, and the philosophical implications of self-awareness in AI. Kulveit also critiques existing frameworks for understanding agency, shedding light on the dynamics of collective behaviors.
undefined
Apr 3, 2025 • 28min

“The Pando Problem: Rethinking AI Individuality” by Jan_Kulveit

In this engaging discussion, guest Jan Kulveit, an author and AI researcher, explores the concept of individuality in artificial intelligence, using the Pando aspen grove as a metaphor. He examines the risks of attributing human-like qualities to AI, urging a reevaluation of how we understand AI behaviors. He also discusses collective agency in AI systems, including the implications for coordination and ethical alignment. Kulveit emphasizes the need for robust models that account for the complexities of AI identity and autonomy in dialogue with humans.
undefined
Feb 5, 2025 • 11min

“Gradual Disempowerment, Shell Games and Flinches” by Jan_Kulveit

In this engaging discussion, Jan Kulveit, author and insightful thinker on AI risks, delves into the concept of Gradual Disempowerment. He examines how as human cognition loses its value, societal systems may become misaligned with human interests. Kulveit highlights intriguing patterns of avoidance in conversations about AI, encapsulated by ideas like 'shell games' and 'flinches.' He also warns against the dangers of delegating too much to future AI, encouraging a more proactive engagement with the complex challenges ahead.
undefined
Jan 26, 2025 • 18min

“A Three-Layer Model of LLM Psychology” by Jan_Kulveit

Jan Kulveit, author and AI enthusiast, delves into the fascinating psychology of character-trained LLMs like Claude. He presents a three-layer model: the Surface Layer, Character Layer, and Predictive Ground Layer, illustrating how they interact and shape AI behaviors. Kulveit discusses the implications of anthropomorphizing LLMs, emphasizing a nuanced understanding of their authenticity. He also tackles the limitations and open questions that arise when interpreting AI interactions, providing insights that could redefine our approach to engaging with language models.