4min chapter

CSPI Podcast cover image

AI Alignment as a Solvable Problem | Leopold Aschenbrenner & Richard Hanania

CSPI Podcast

CHAPTER

The Basic Physics of Interpretability

The interpretability work I've described so far is a bit more kind of like the sort of like top down interpretability. Most of the time when people talk about interpretability, they mean mechanistic interpretability. So that's basically we're going to like sort of like think of this as sort of like the basic physics version of interpretability. There's a lot on topic is then good, you know, sort of the pioneer of this is then awesome work. Neil Nann, there's a person who's sort of maybe you've seen sort of online and is active and has done some really interesting work on this. For example, sometimes it seems like neural networks suddenly understand the thing

00:00

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode