The Nonlinear Library

AF - Different senses in which two AIs can be "the same" by Vivek Hebbar

Jun 24, 2024
AI Alignment researcher and author, Vivek Hebbar, explores the different senses in which two AIs can be considered the same or different. He discusses distinctions such as model weights, pretrained identity, shared context, shared activations, shared memory, shared reward, and shared role in training, and how these impact collusion and coordination in AI safety.
Ask episode
Chapters
Transcript
Episode notes