Latent Space: The AI Engineer Podcast cover image

Llama 2: The New Open LLM SOTA (ft. Nathan Lambert, Matt Bornstein, Anton Troynikov, Russell Kaplan, Whole Mars Catalog et al.)

Latent Space: The AI Engineer Podcast

00:00

Navigating Language Model Training Data

This chapter explores the intricacies of collecting preference data essential for training language models, emphasizing the challenges of sourcing quality data. It discusses the shift towards learning from human feedback, the role of diverse supervised datasets, and the potential impact of uncensored datasets on model performance. Additionally, the conversation highlights the advancements around the Llama model and the collaborative ecosystem that supports its fine-tuning and application in AI development.

Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner
Get the app