
35 - Peter Hase on LLM Beliefs and Easy-to-Hard Generalization
AXRP - the AI X-risk Research Podcast
00:00
Intro
This chapter explores the speaker's pathway into natural language processing, discussing their academic fascination with language and machine learning. It also highlights their early hands-on experiences with language models and the evolution of their research focus on interpretability within these systems.
Transcript
Play full episode