Evaluating Language Models and Theory of Mind

This chapter explores the AI community's reactions to evaluations of language models and their cognitive capabilities. Through the lens of the Sally Ann test, it examines the complexities in understanding whether these models genuinely reason or merely mimic patterns from their training data.

Play episode from 31:29

Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app