LessWrong (Curated & Popular)

AI Control: Improving Safety Despite Intentional Subversion

Dec 15, 2023
Ask episode
Chapters
Transcript
Episode notes