3min chapter

Lex Fridman Podcast cover image

#368 – Eliezer Yudkowsky: Dangers of AI and the End of Human Civilization

Lex Fridman Podcast

CHAPTER

The Importance of Intuition in AI Alignment Safety Research

The difficulty is what makes the human say I understand and is it true? Is it correct or is it something that fools the human? When the verifier is broken the more powerful suggestion does not help. It just learns to fool the verifier. We'll mention that also but maybe in this perfect world where we can do serious alignment research humans and AI together. RLHF thumbs up produce more outputs like that one!

00:00

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode