
40 - Jason Gross on Compact Proofs and Interpretability
AXRP - the AI X-risk Research Podcast
00:00
Exploring Modular Arithmetic in Neural Networks
This chapter examines the role of modular arithmetic in infinite width multi-layer perceptrons and how they approximate integrals. It also investigates the relationship between symmetry, singular value decomposition, and output manipulation within neural networks, highlighting the importance of understanding these concepts for mechanistic interpretability.
Transcript
Play full episode