AI-powered
podcast player
Listen to all your favourite podcasts with AI-powered features
The Gradient Podcast - Part 2
We're pretty optimistic that it has at least some bearing on large language models. Even if you do want to train these models, models at this scale can easily be trained on a single GPU and relatively small data sets. So for us, without small supercomputers to run GP3, does that seem like it'll help? That was really interesting for me. We need these papers.