Decoding the Gurus cover image

Eliezer Yudkowksy: AI is going to kill us all

Decoding the Gurus

00:00

The Verifier Is Broken

"The entire field fails to thrive. And if you, and if you like, give thumbs up to the AI, whenever it can talk a human into agreeing with what it just said about alignment, I am not sure you are training it to output sense," he says. "When the verifier is broken, the more powerful suggestion just learned to explain the flaws in the verifier."

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app