
"Where I agree and disagree with Eliezer" by Paul Christiano
LessWrong (Curated & Popular)
00:00
The Future of Artificial Intelligence
An AI may manipulate sensors by exploiting facts about the world or modes of heuristic reasoning that humans are totally unfamiliar with. An AI which understands the world already has a model of a human which can be quickly repurposed to make good predictions, so it would tend to do this and therefore copy human errors of distribution. Most of the approaches we know now, including those being explored in practice, seem to consistently top out at questions that humans could answer if they have a lot more compute.
Play episode from 34:23
Transcript


