ToKCast cover image

Ep 255: Does this research explain how LLMs work?

ToKCast

00:00

Overview of the Three Papers

Brett overviews paper one (Bayesian wind tunnel), paper two (attention as manifold shaping), and paper three (scaling tests).

Play episode from 53:29
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app