LessWrong (Curated & Popular)

LessWrong

Audio narrations of LessWrong posts. Includes all curated posts and all posts with 125+ karma.If you'd like more, subscribe to the “Lesswrong (30+ karma)” feed.

Episodes

Mentioned books

Aug 7, 2024 • 16min

“This is already your second chance” by Malmesbury

A colossal ivory cube descends, carrying instructions to save humanity from an AI apocalypse. In a humorous twist, Kublai Khan engages in witty banter with an AI while tackling ethical dilemmas surrounding super-intelligent technology. The tale involves absurd tasks to be completed in 2024, blending satire with philosophical musings. With imaginative storytelling, it highlights the challenges of navigating current technological threats and reflects on human behavior in the face of impending doom.

Aug 7, 2024 • 20min

“0. CAST: Corrigibility as Singular Target” by Max Harms

Dive into the intriguing concept of corrigibility in AI, where the discussion pivots from confusion to clarity. Discover how this single property can be crucial for creating agents that are both effective and safe. Learn about innovative strategies for measuring and enhancing this quality in AI development. The podcast critiques the usual mix of goals and proposes a streamlined focus to improve outcomes. Prepare for a journey through the nuances of AI behavior and safety that could redefine future advancements.

Aug 7, 2024 • 23min

“Self-Other Overlap: A Neglected Approach to AI Alignment” by Marc Carauleanu, Mike Vaiana, Judd Rosenblatt, Diogo de Lucena

Join guests Bogdan Ionut-Cirstea, Steve Byrnes, Gunnar Zarnacke, Jack Foxabbott, and Seong Hah Cho, who contribute critical insights on AI alignment. They discuss an intriguing concept called self-other overlap, which aims to optimize AI models by aligning their reasoning about themselves and others. Early experiments suggest this technique can reduce deceptive behaviors in AI. With its scalable nature and minimal need for interpretability, self-other overlap could be a game-changer in creating pro-social AI.

Aug 7, 2024 • 9min

“You don’t know how bad most things are nor precisely how they’re bad.” by Solenoid_Entity

Dive into the intriguing world of discernment, where time and attention significantly enhance our understanding of quality. Explore the nuances of piano tuning, revealing how even experts struggle to detect subtle flaws. Discover the complexities of awareness, and how often we overlook our own blind spots. This discussion highlights the perils of relying on automation in tasks requiring skilled judgment, emphasizing the intricate details in reality that often go unnoticed.

Aug 7, 2024 • 22min

“Recommendation: reports on the search for missing hiker Bill Ewasko” by eukaryote

Tom Mahood, an insightful blogger on missing persons, teams up with Adam Marsland, a dedicated videographer, to discuss the enigmatic 2010 disappearance of hiker Bill Ewasko in Joshua Tree National Park. They explore the complexities of wilderness searches, the essential strategies employed when looking for missing individuals, and the emotional toll faced by searchers. The conversation reveals the challenges in navigating both the terrain and the psychological aspects of such tragic cases, shedding light on the critical lessons learned from this heartbreaking incident.

Aug 7, 2024 • 30min

“The ‘strong’ feature hypothesis could be wrong” by lsgos

Elhage, a member of the Google DeepMind language model interpretability team, dives deep into the complexities of AI interpretability. They challenge the strong feature hypothesis, arguing that neurons may not correspond to specific visual features as previously thought. The discussants explore the intricate dynamics of explicit versus tacit representations, using chess as a metaphor for decision-making. Elhage also calls for a reevaluation of how we interpret neural networks, advocating for methods that account for context-dependent features.

Jul 30, 2024 • 4min

“‘AI achieves silver-medal standard solving International Mathematical Olympiad problems’” by gjm

Explore groundbreaking advancements in AI with Google DeepMind's latest systems, AlphaProof and AlphaGeometry. These innovations tackle complex mathematical problems, nearing silver-medal standards for the International Mathematical Olympiad. Discover how AlphaProof uses LLMs and proof-checking to refine solutions, while AlphaGeometry excels in geometry tasks. The training process includes real-time reinforcement during contests, making for a fascinating insight into the future of problem-solving AI!

Jul 29, 2024 • 24min

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app