
"Discussion with Nate Soares on a key alignment difficulty" by Holden Karnofsky
LessWrong (Curated & Popular)
00:00
Introduction
This is an audio version of discussion with Nate Zwara's on a key alignment difficulty by Holden Kannowski, published on the 14th of March, 2023. Cross-posted from the AI alignment forum may contain more technical jargon than usual. In late 2022, Nate gave some feedback on my cold takes on AI risk, shared as drafts at that point. I wanted to understand the difficulty he was pointing to, so the two of us had an extended Slack exchange,. I then wrote up a summary of the exchange that we iterated on until we were both reasonably happy with its characterization of the difficulty and our disagreement. We probably spent more time on the summary than on the exchange itself
Play episode from 00:00
Transcript


