Latent Space: The AI Engineer Podcast cover image

How to train your own Large Multimodal Model — with Hugo Laurençon & Leo Tronchon of HuggingFace M4

Latent Space: The AI Engineer Podcast

00:00

Challenges in Training Multimodal Models

This chapter explores the complexities of processing raw HTML for training large multimodal models, with a focus on data quality, deduplication, and efficient dataset management. The speakers share insights on utilizing synthetic data, scaling training processes, and overcoming challenges like loss escalation and debugging. They also discuss lessons learned from the iterative training of models and the role of knowledge sharing in enhancing performance.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app