4min chapter

FUTURATI PODCAST cover image

Ep. 129: Applying the 'security mindset' to AI and x-risk | Jeffrey Ladish

FUTURATI PODCAST

CHAPTER

The Future of Alignment Research

I get the sense that alias or just isn't super on board with any of them, and they have all that they all have a bunch of kind of obvious failure modes. I don't feel like anyone has proposed something that's like, yes, this approach could really work. Work I'm excited about is one of them one of the areas is just interpretability or like mechanistic interpretability also anomaly detection. Paul Christiano is like doing a bunch of stuff at arc. That I think it's pretty interesting. Yeah, we're ready for sure.

00:00

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode