

Nora Belrose
Head of interpretability at EleutherAI, focusing on understanding and improving AI's inner workings and alignment.
Best podcasts with Nora Belrose
Ranked by the Snipd community

9 snips
Nov 17, 2024 • 2h 30min
Nora Belrose - AI Development, Safety, and Meaning
Nora Belrose, Head of Interpretability Research at EleutherAI, dives into the complexities of AI development and safety. She explores concept erasure in neural networks and its role in bias mitigation. Challenging doomsday fears about advanced AI, she critiques current alignment methods and highlights the limitations of traditional approaches. The discussion broadens to consider the philosophical implications of AI's evolution, including a fascinating link between Buddhism and the search for meaning in a future shaped by automation.

Feb 19, 2025 • 60min
Interpreting AI’s Acceleration (Robert Wright & Nora Belrose)
Nora Belrose, Head of Interpretability at EleutherAI, specializes in making AI more understandable and aligned with human values. She discusses whether a technological singularity is imminent and shares her concerns about AI potentially taking over jobs in just two years. The conversation dives into the evolution of AI reasoning models, contrasting them with human thought processes. Nora also emphasizes the importance of transparency in AI development and explores the societal impacts of open-source AI.