
41 - Lee Sharkey on Attribution-based Parameter Decomposition
AXRP - the AI X-risk Research Podcast
00:00
Exploring Neural Network Dynamics
This chapter investigates experimental methods in neural networks, focusing on Attribution-based Parameter Decomposition (APD) and its application to toy models. It discusses the intricacies of group operation networks, the impact of hyperparameters, and highlights notable literature examples of attention mechanisms. The conversation reflects on challenges in comprehending computational properties and offers insights into neural activations and information processing in deep learning models.
Transcript
Play full episode