2min snip

BG2Pod with Brad Gerstner and Bill Gurley  cover image

Ep8. AI Models, Data Scaling, Enterprise & Personal AI | BG2 with Bill Gurley & Brad Gerstner

BG2Pod with Brad Gerstner and Bill Gurley

NOTE

Maximizing Model Capabilities Through Continuous Training

The biggest impact of llama three lies in its capabilities, achieved by training the model past the Chinchilla point to maximize information and capability retention. By continuously curating and refining data through forward passes, Meta was able to pack much more capability into the model with the same data set. This continuous training approach exceeded expectations, with llama three still learning even when taken offline for resource reallocation to llama four. The rapid innovation is evident with the 15 trillion tokens used to train llama three, showcasing Meta's commitment to pushing the boundaries of model training and capabilities.

00:00

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode