LessWrong (Curated & Popular) cover image

"(My understanding of) What Everyone in Technical Alignment is Doing and Why" by Thomas Larsen & Eli Lifland

LessWrong (Curated & Popular)

00:00

The Ontology Identifier Problem

Two key properties that might be selected for are modularity and abstractions. Humans tend to use really similar abstractions, even across different cultures slash societies. The natural abstraction hypothesis states that a wide variety of cognitive architectures will likely use similar abstractions to reason about the world. We can understand when environments induce certain abstractions, and so we can design this so that the network has the same abstractions as humans.

Play episode from 01:18:03
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app