AXRP - the AI X-risk Research Podcast cover image

13 - First Principles of AGI Safety with Richard Ngo

AXRP - the AI X-risk Research Podcast

00:00

Transparency Research

Transparency is, in some sense, one of the corps underlying drivers of many proposals for allignment. The other cors driver here is just using more human data to nudge systems towards fulfilling human preference as better. A lot of research atgenders or a, assuming a certain amount of interpretability or transparent transparency. Is this important part of a aglinment?

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app