
35 - Peter Hase on LLM Beliefs and Easy-to-Hard Generalization
AXRP - the AI X-risk Research Podcast
00:00
Challenging Assumptions in Model Knowledge Editing
This chapter examines a research paper that challenges traditional views on knowledge storage in neural network layers and its impact on model editing methods. It reveals unexpected findings that suggest a more intricate relationship between layer knowledge and editing efficacy than previously believed.
Transcript
Play full episode