LessWrong (Curated & Popular)

The Plan - 2023 Version

Jan 4, 2024
The hosts discuss their plans for AI alignment, focusing on interpretability and finding alignment targets. They also highlight the importance of robust bottlenecks. The podcast explores the role of abstraction in AI systems and the challenges in choosing ontologies. It delves into good heart problems, approximation, and optimizing for true names. The concept of designing for zero information leak and the role of chaos is discussed. The challenges of abstraction and reward-based approaches in AI training are explored. The podcast also looks at the iterative process in engineering and software/AI development.
Ask episode
Chapters
Transcript
Episode notes