LessWrong (Curated & Popular)

LessWrong

Audio narrations of LessWrong posts. Includes all curated posts and all posts with 125+ karma.If you'd like more, subscribe to the “Lesswrong (30+ karma)” feed.

Episodes

Mentioned books

Oct 15, 2023 • 10min

[HUMAN VOICE] "Inside Views, Impostor Syndrome, and the Great LARP" by John Wentworth

Yoshua Bengio, a Turing Award winner for deep learning research, discusses the importance of deep models and understanding in ML. Topics include Unitary Evolution Recurrent Neural Networks and gradient explosion/death in recurrent nets. They also explore imposter syndrome and progress in fields through feedback loops and admitting lack of knowledge.

Oct 15, 2023 • 9min

"Towards Monosemanticity: Decomposing Language Models With Dictionary Learning" by Zac Hatfield-Dodds

The podcast discusses the challenges and solutions for understanding the behaviors of neural networks. It explores the use of features instead of individual neurons and the decomposition of language models into interpretable parts. The concept of interpretability in language models is also explored, highlighting the importance of features and the influence of activating specific features. The potential for decomposing models into interpretable features is discussed, along with the universality of learned features and anthropics investment in mechanistic interpretability.

Oct 9, 2023 • 17min

"Response to Quintin Pope’s Evolution Provides No Evidence For the Sharp Left Turn" by Zvi

Guest Quintin Pope argues that the sudden growth in human capabilities was due to cultural transmission, not evolution, and this does not apply to AI. The podcast explores the parallels between human evolution and AI development, the challenges of alignment techniques, and the importance of discernment in autonomous learning.

Oct 6, 2023 • 52min

"Thomas Kwa's MIRI research experience" by Thomas Kwa and others

Thomas Kwa shares his research experience working with MIRI. They discuss concrete problems, information hazard prevention, agent reflection, deconfusion, empirical feedback, and understanding cognition.

Oct 3, 2023 • 6min

"The Lighthaven Campus is open for bookings" by Habryka

Explore the newly renovated Light Haven Campus available for bookings. Learn about the range of services, buildings, and goals of Lighthaven. Discover the wide variety of events that can be hosted with preferential pricing for projects beneficial to the world.

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app

LessWrong (Curated & Popular)

Episodes

Mentioned books

[HUMAN VOICE] "Inside Views, Impostor Syndrome, and the Great LARP" by John Wentworth

"Comparing Anthropic's Dictionary Learning to Ours" by Robert_AIZI

"Announcing MIRI’s new CEO and leadership team" by Gretta Duleba

"Cohabitive Games so Far" by mako yass

"Announcing Dialogues" by Ben Pace

"Evaluating the historical value misspecification argument" by Matthew Barnett

"Towards Monosemanticity: Decomposing Language Models With Dictionary Learning" by Zac Hatfield-Dodds

"Response to Quintin Pope’s Evolution Provides No Evidence For the Sharp Left Turn" by Zvi

"Thomas Kwa's MIRI research experience" by Thomas Kwa and others

"The Lighthaven Campus is open for bookings" by Habryka

The AI-powered Podcast Player