LessWrong (Curated & Popular) cover image

“EIS XIII: Reflections on Anthropic’s SAE Research Circa May 2024” by scasper

LessWrong (Curated & Popular)

00:00

Introduction

The author evaluates Anthropic's latest research on sparse auto encoders, discussing the achievements and limitations of the paper while expressing reservations about the practicality and efficacy of their interpretability research approach.

Play episode from 00:00
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app