The Gradient: Perspectives on AI cover image

Catherine Olsson and Nelson Elhage: Anthropic, Understanding Transformers

The Gradient: Perspectives on AI

00:00

The Gradient Podcast - Part 2

We're pretty optimistic that it has at least some bearing on large language models. Even if you do want to train these models, models at this scale can easily be trained on a single GPU and relatively small data sets. So for us, without small supercomputers to run GP3, does that seem like it'll help? That was really interesting for me. We need these papers.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app