
41 - Lee Sharkey on Attribution-based Parameter Decomposition
AXRP - the AI X-risk Research Podcast
00:00
Exploring Connections Between Neural Network Techniques
This chapter delves into sparse autoencoders and introduces a novel method for addressing challenges in neural networks. It emphasizes a simplified approach by analyzing network components through the lens of activation description length and parameter space.
Transcript
Play full episode