Vanishing Gradients cover image

Episode 26: Developing and Training LLMs From Scratch

Vanishing Gradients

00:00

Implementing Deep Learning Models from Scratch

The chapter discusses the importance of understanding key concepts like backpropagation, linear algebra, and implementing deep learning models from scratch using tools like PyTorch. It highlights the significance of grasping underlying mechanisms while leveraging tools like PyTorch Lightning, Scikit-learn, and Keras for deep learning work. The conversation also covers challenges with large language models, options for training on platforms like Google Colab or Lightning AI, and insights on creating open-source projects like Lit GPT.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app