
"(My understanding of) What Everyone in Technical Alignment is Doing and Why" by Thomas Larsen & Eli Lifland
LessWrong (Curated & Popular)
00:00
The Ontology Identifier Problem
Two key properties that might be selected for are modularity and abstractions. Humans tend to use really similar abstractions, even across different cultures slash societies. The natural abstraction hypothesis states that a wide variety of cognitive architectures will likely use similar abstractions to reason about the world. We can understand when environments induce certain abstractions, and so we can design this so that the network has the same abstractions as humans.
Play episode from 01:18:03
Transcript


