
41 - Lee Sharkey on Attribution-based Parameter Decomposition
AXRP - the AI X-risk Research Podcast
00:00
Challenges in Attribution-based Parameter Decomposition
This chapter discusses the practical challenges and complexities involved in running Attribution-based Parameter Decomposition (APD) on varying network sizes, weighing the trade-offs between efficiency and theoretical satisfaction. The conversation emphasizes the computational demands of managing multiple model components and the implications for model training, particularly regarding forward and backward passes. Furthermore, the chapter explores future research directions aimed at improving robustness and scalability in neural networks, focusing on attention mechanisms and the need for diverse training input distributions.
Transcript
Play full episode