
How Language Models Actually Think
The Data Exchange with Ben Lorica
00:00
Mechanisms Behind Hallucinations
Emmanuel details neural mechanisms where 'I know' detectors trigger confident but incorrect answers.
Play episode from 05:39
Transcript

Emmanuel details neural mechanisms where 'I know' detectors trigger confident but incorrect answers.