LessWrong (Curated & Popular) cover image

Critical review of Christiano’s disagreements with Yudkowsky

LessWrong (Curated & Popular)

00:00

Unknown Unknowns of Deep Learning and the Safety of Alignment Protocols

This chapter explores the unknowns of deep learning's generalization properties, discussing the lack of understanding in conditions for good generalization and the potential risks of misgeneralization.

Play episode from 14:41
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app