Vanishing Gradients cover image

Episode 26: Developing and Training LLMs From Scratch

Vanishing Gradients

00:00

Evolution of Language Model Techniques

Exploring the evolution of language model techniques from foundational concepts to cutting-edge methods like Laura and QLORA. The chapter delves into innovative architectural concepts, advancements in language model training, Decision-Points Optimization strategy, sparse attention mechanisms, fine-tuning GPT-2 for text classification, and the process of fine-tuning a model through iterative parameter updates.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app