Cameron Jones & Sean Trott: Understanding, Grounding, and Reference in LLMs

70 snips

Feb 22, 2024

Researchers Cameron Jones and Sean Trott discuss the unexpected capabilities of language models, challenges in interpreting results of Turing tests, and the tension in lexical ambiguity. They explore the efficiency of language, internal mechanisms of language models, and the balance of meanings across wordforms. The conversation also delves into physical plausibility in language comprehension, theory of mind abilities in language models, and critiques of evaluating language models like GPT.

Ask episode

AI Snips

Chapters

Transcript

Episode notes

INSIGHT

Efficiency Shapes Lexical Ambiguity

Lexical ambiguity often reflects efficient reuse of word forms balancing speaker and comprehender needs.
Real lexica show comprehender-oriented pressure reduces excessive ambiguity compared to neutral baselines.

INSIGHT

LLMs Partially Capture Mental States

Language models show sensitivity to mental states in the false belief task but don’t fully explain human behavior.
This suggests linguistic signals alone contribute but extra resources are needed for full human-like theory of mind.

INSIGHT

Behavior vs Internal Representations

Behavioral similarity alone cannot confirm shared internal mental representations between humans and LLMs.
Discovering decodable circuits representing mental states in LLMs would strongly support similar computational mechanisms.

Get the Snipd Podcast app to discover more snips from this episode

Get the app