
Exploring the Biology of LLMs with Circuit Tracing with Emmanuel Ameisen - #727
The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)
00:00
Interventions in Biological Studies and Language Models
This chapter explores the concept of interventions in biological studies with a focus on language models, highlighting the differences between nudges and direct manipulations. It examines how these interventions affect model behavior, detailing the complexities of feature representation and interactions within multilayer perceptrons. Through real-world examples, the discussion reveals the intricacies of model decision-making processes and the organization of mathematical operations like addition within computational architectures.
Transcript
Play full episode