
33 - RLHF Problems with Scott Emmons
AXRP - the AI X-risk Research Podcast
00:00
Discussion on a Research Paper and its Place in AI Risk Research
Exploring a research paper on deception in AI, focusing on the detailed mathematical content in the appendices. The speaker highlights the significance of sharing substantive information on AI risk with technical researchers and discusses the paper's placement within a larger research program on AI alignment and extras.
Transcript
Play full episode