

Building Production Workflows for AI Applications
20 snips Jun 14, 2024
In this podcast, Tony Holdstock-Brown discusses the challenges of running AI workflows in production. He highlights the parallel tracks of CPU and GPU engineering, emphasizing the differences between application-level and mathematical sides. The conversation explores opportunities for improvement in developer tools for generative AI and offers advice for engineers entering the field.
AI Snips
Chapters
Transcript
Episode notes
Tony's Early Coding Journey
- Tony Holdstock-Brown shared how he started coding young, making simple games and joke machines.
- His early projects transitioned into internet apps for broader usage and feedback.
Queues Essential for AI Workflows
- Most applications need queues and event systems for workflows and state management.
- These complex pipelines are crucial for AI workflows but challenging to manage reliably at scale.
Fairness and Concurrency Challenges
- AI workloads require multi-tenant fairness and concurrency due to costly, limited GPU resources.
- Managing queues in AI apps is complex but essential to avoid poor user experiences and costs.