AI-powered
podcast player
Listen to all your favourite podcasts with AI-powered features
Evaluating Language Models and Theory of Mind
This chapter explores the AI community's reactions to evaluations of language models and their cognitive capabilities. Through the lens of the Sally Ann test, it examines the complexities in understanding whether these models genuinely reason or merely mimic patterns from their training data.