Vanishing Gradients cover image

Episode 26: Developing and Training LLMs From Scratch

Vanishing Gradients

00:00

Evolution of Language Model Techniques

Exploring the evolution of language model techniques from foundational concepts to cutting-edge methods like Laura and QLORA. The chapter delves into innovative architectural concepts, advancements in language model training, Decision-Points Optimization strategy, sparse attention mechanisms, fine-tuning GPT-2 for text classification, and the process of fine-tuning a model through iterative parameter updates.

Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner
Get the app