AXRP - the AI X-risk Research Podcast cover image

33 - RLHF Problems with Scott Emmons

AXRP - the AI X-risk Research Podcast

00:00

Discussion on a Research Paper and its Place in AI Risk Research

Exploring a research paper on deception in AI, focusing on the detailed mathematical content in the appendices. The speaker highlights the significance of sharing substantive information on AI risk with technical researchers and discusses the paper's placement within a larger research program on AI alignment and extras.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app