Latent Space: The AI Engineer Podcast cover image

The Busy Person's Intro to Finetuning & Open Source AI - Wing Lian, Axolotl

Latent Space: The AI Engineer Podcast

00:00

Advanced AI Training Techniques

This chapter explores cutting-edge AI training methods, focusing on Stack Llama and Multipack from Hugging Face, which optimize data packing for transformer models. It also introduces innovative architectures like Mamba, highlighting significant improvements in model performance, efficiency, and community impact in the evolving landscape of open-source AI.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app