Latent Space: The AI Engineer Podcast cover image

The End of Finetuning — with Jeremy Howard of Fast.ai

Latent Space: The AI Engineer Podcast

NOTE

Smooth Pathway to Learning

Exploring the training dynamics of language models reveals the importance of architectural features such as skip connections. Research has shown that architectures incorporating these features lead to smoother optimization landscapes, contrasting the bumpy surfaces found in their absence. Recent advancements, such as a new 3D matrix product visualization from the PyTorch team, exemplify the innovative approaches needed to better understand these complex training processes.

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner