Latent Space: The AI Engineer Podcast cover image

Latent Space: The AI Engineer Podcast

How to train a Million Context LLM — with Mark Huang of Gradient.ai

May 30, 2024
Mark Huang, Co-founder of Gradient.ai, dives into the exciting advancements in AI, particularly long context learning. He discusses the evolution of context lengths, mapping out a timeline of breakthroughs and innovations in LLMs. Mark reflects on his team's work with Llama 3 and the challenges of training models with vast token capacities. He also sheds light on optimizing GPU performance and the pressing need for high-quality data in model training. Their vision is creating flexible AI solutions that truly adapt to enterprise workflows.
57:30

Episode guests

Podcast summary created with Snipd AI

Quick takeaways

  • Transitioning from finance to tech to leverage large-scale machine learning at Gradient.ai.
  • Challenges in handling exploding gradients, precision issues, and variable state tracking in AI models.

Deep dives

Building a Full Stack AI Platform at Gradient

Mark Wang shares the journey of transitioning from the quant finance world to tech, where he worked at Box and Splunk before founding Gradient. The shift aimed to leverage large-scale machine learning and collaboration beyond a bubble, leading to the creation of Gradient to maximize AI contributions to businesses.

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner