2min chapter

AXRP - the AI X-risk Research Podcast cover image

4 - Risks from Learned Optimization with Evan Hubinger

AXRP - the AI X-risk Research Podcast

CHAPTER

Masoptimizer

The paper is sort of talking about these risks from, like, these learned optimisers and optimizes. And i'm wondering, like it is mess optimiser the right category? If you think about the sutabular masoptmiser, how likely would you be to actually fine the sort of tabular masoptimizer? I think the answer is very unlikey. It has to incode all this sort of really complex, like, results of athorization explicitly in a way. This takes up so much space and explicit memorization. But not just memorization, like, what your deployment disfigured, what your sort of, like, out of distribution behaviour would be,

00:00

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode