Vanishing Gradients cover image

Vanishing Gradients

Episode 26: Developing and Training LLMs From Scratch

May 15, 2024
Sebastian Raschka discusses developing and training large language models (LLMs) from scratch, covering topics like prompt engineering, fine-tuning, and RAG systems. They explore the skills, resources, and hardware needed, the lifecycle of LLMs, live coding to create a spam classifier, and the importance of hands-on experience. They also touch on using PyTorch Lightning and fabric for managing large models, and reveal insights on techniques in natural language processing models and evaluating LLMs for classification problems.
01:51:35

Podcast summary created with Snipd AI

Quick takeaways

  • In building Large Language Models (LLMs), understanding the input-output dynamics is crucial for coding the data input process.
  • Engineering LLMs involves pre-training on broad data sets followed by fine-tuning for specific tasks to enhance model capabilities.

Deep dives

Building LLMs: Understanding the LLM Life Cycle from Scratch

To delve into the world of building Large Language Models (LLMs), it's essential to start by understanding the entire life cycle of an LLM from the ground up. Beginning with the fundamental concept of feeding data into the model, you lay the foundation for comprehending the input-output dynamics of LLMs. This first step involves coding the data input process before diving into the complexities of the LLM architecture.

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode