2min chapter

80,000 Hours Podcast cover image

#107 – Chris Olah on what the hell is going on inside neural networks

80,000 Hours Podcast

CHAPTER

How to Identify the Norons in a Model?

Researchers often look at weights in the first layer ios. And so if you want to study weights anywhere other than te the absolute input or maybe the absolute output, you need to go and have some technique for understanding what the norons that are going in and out of those weights are. One thing we found is actually very effective is to optimize the input. We call this feature f cilization. You just do gradiant descent to go an crate and image, say, that causes the neran to fire really strongly. It separates corelation from causation. In that resulting image, you know that everything that's there is there because it caused the nuron to fire.

00:00

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode