The Nonlinear Library cover image

LW - SAE reconstruction errors are (empirically) pathological by wesg

The Nonlinear Library

00:00

Challenges in Evaluating SAE Reconstructions

This chapter delves into the limitations of SAE reconstructions, advocating for the assessment through metrics like KL divergence. It explores the significance of maintaining token probabilities in reconstructions and proposes avenues for future research in improving reconstructions in line with epsilon random substitution KL divergence.

Play episode from 13:44
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app