The Nonlinear Library cover image

LW - SAE reconstruction errors are (empirically) pathological by wesg

The Nonlinear Library

00:00

Introduction

This chapter delves into empirical findings revealing that errors in Sparse Auto Encoder (SAE) reconstruction are significant and consistent through all model layers, leading to a notable increase in KL and loss metrics. Understanding these errors is crucial for advancing SAEs and improving methodologies.

Play episode from 00:00
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app