The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)

Closing the Loop Between AI Training and Inference with Lin Qiao - #742

54 snips
Aug 12, 2025
Lin Qiao, CEO and co-founder of Fireworks AI and former AI leader at Meta, shares insights on optimizing the AI development lifecycle. She emphasizes the importance of aligning training and inference systems to minimize deployment friction. Lin discusses the shift from viewing models as commodities to essential product assets and explains reinforcement fine-tuning for leveraging proprietary data. She also tackles the complex challenge of balancing cost, latency, and quality in AI optimization while envisioning a future with closed-loop systems for automated model improvement.
Ask episode
AI Snips
Chapters
Transcript
Episode notes
INSIGHT

Align Experimentation And Production Loops

  • Fast iteration and production loops must be aligned for velocity and product validation.
  • Training and inference need cohesive systems to avoid conversion bottlenecks and preserve model quality.
ADVICE

Use One Inference Stack End-To-End

  • Keep the same inference system for experimentation and large-scale production to enable rapid ramp-up.
  • Avoid long conversion workflows that block product teams and slow A/B testing.
ANECDOTE

PyTorch Experience Informed Fireworks

  • Lin draws on her PyTorch experience to validate the need for end-to-end developer platforms.
  • PyTorch's role in industry influenced Fireworks' design to bridge experimentation and production.
Get the Snipd Podcast app to discover more snips from this episode
Get the app