AXRP - the AI X-risk Research Podcast cover image

35 - Peter Hase on LLM Beliefs and Easy-to-Hard Generalization

AXRP - the AI X-risk Research Podcast

NOTE

Language Models Reflect Reality

Language models are designed under the assumptions of understanding the user's inquiries and striving to provide truthful responses. They are believed to represent aspects of the world, acting as competent communicators in various contexts. The evaluation of their responses can be approached by analyzing the probability of affirmations or negations to yes/no questions, suggesting that their output reflects underlying beliefs and representation of reality.

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner