
13 - First Principles of AGI Safety with Richard Ngo
AXRP - the AI X-risk Research Podcast
00:00
Transparency Research
Transparency is, in some sense, one of the corps underlying drivers of many proposals for allignment. The other cors driver here is just using more human data to nudge systems towards fulfilling human preference as better. A lot of research atgenders or a, assuming a certain amount of interpretability or transparent transparency. Is this important part of a aglinment?
Transcript
Play full episode