The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)

Closing the Loop Between AI Training and Inference with Lin Qiao - #742

65 snips

Aug 12, 2025

Lin Qiao, CEO and co-founder of Fireworks AI and former AI leader at Meta, shares insights on optimizing the AI development lifecycle. She emphasizes the importance of aligning training and inference systems to minimize deployment friction. Lin discusses the shift from viewing models as commodities to essential product assets and explains reinforcement fine-tuning for leveraging proprietary data. She also tackles the complex challenge of balancing cost, latency, and quality in AI optimization while envisioning a future with closed-loop systems for automated model improvement.

Ask episode

AI Snips

Chapters

Transcript

Episode notes

INSIGHT

Align Experimentation And Production Loops

Fast iteration and production loops must be aligned for velocity and product validation.
Training and inference need cohesive systems to avoid conversion bottlenecks and preserve model quality.

ADVICE

Use One Inference Stack End-To-End

Keep the same inference system for experimentation and large-scale production to enable rapid ramp-up.
Avoid long conversion workflows that block product teams and slow A/B testing.

ANECDOTE

PyTorch Experience Informed Fireworks

Lin draws on her PyTorch experience to validate the need for end-to-end developer platforms.
PyTorch's role in industry influenced Fireworks' design to bridge experimentation and production.

Get the Snipd Podcast app to discover more snips from this episode

Get the app