3min chapter

The Inside View cover image

3. Evan Hubinger on Takeoff speeds, Risks from learned optimization & Interpretability

The Inside View

CHAPTER

Transparency Training

We need to train models in such a way that doesn't just look at the model's behavior. We have to use transparence wolls to solve the problem in the first place. And so i'm so in favor of approaches where we we directly train models to sort of using transparent tls. Whereas chris is more like an post mortemyo, you'll see why it isnt. Wi idn't work, ye amso think athing to di a.

00:00

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode