
David Lorell
Author of the article being narrated on this episode; presents a proof showing that resampling approximately conserves redundancy and mediation under the Jensen–Shannon divergence. Contributor to LessWrong posts on information-theoretic properties of latent variables.
Best podcasts with David Lorell
Ranked by the Snipd community

Feb 9, 2024 • 12min
[HUMAN VOICE] "A Shutdown Problem Proposal" by johnswentworth, David Lorell
In this podcast, johnswentworth and David Lorell propose a solution to the shutdown problem in AI by using a sub-agent architecture and negotiation between utility-maximizing subagents. They discuss the design of an agent with multiple subagents and the importance of corrugibility. They also explore alignment problems, ontological issues, designing utility functions, and challenges in bridging the theory-practice gap.


