4min chapter

AXRP - the AI X-risk Research Podcast cover image

4 - Risks from Learned Optimization with Evan Hubinger

AXRP - the AI X-risk Research Podcast

CHAPTER

Proxy Linement Failures

In practice, in many situations, we don't train the zero training error. We have the really complex datuss tat are very difficult to fit completely. In that situation, it' just an identifiabilityand in fact, you can end up in a situation where the inducted biases can be stronger, in some sense, than the a sort of the train bot. If you trains sot ip too far on the train data, sort of perfectly fit, you sort of overfit the train data. But if you stop such that your sorto still have a strong influence on your doctovisis unlike the actual sort of thing that you met up with, then you're in

00:00

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode