AXRP - the AI X-risk Research Podcast

38.2 - Jesse Hoogland on Singular Learning Theory

Nov 27, 2024
Jesse Hoogland, executive director of Timaeus and researcher in singular learning theory (SLT), shares fascinating insights on AI alignment. He dives into the concept of the refined local learning coefficient (LLC) and its role in uncovering new circuits in language models. The conversation also touches on the challenges of interpretability and model complexity. Hoogland emphasizes the importance of outreach efforts in disseminating research and fostering interdisciplinary collaboration to enhance understanding of AI safety.
Ask episode
Chapters
Transcript
Episode notes