AI + a16z

Building Production Workflows for AI Applications

20 snips
Jun 14, 2024
In this podcast, Tony Holdstock-Brown discusses the challenges of running AI workflows in production. He highlights the parallel tracks of CPU and GPU engineering, emphasizing the differences between application-level and mathematical sides. The conversation explores opportunities for improvement in developer tools for generative AI and offers advice for engineers entering the field.
Ask episode
AI Snips
Chapters
Transcript
Episode notes
ANECDOTE

Tony's Early Coding Journey

  • Tony Holdstock-Brown shared how he started coding young, making simple games and joke machines.
  • His early projects transitioned into internet apps for broader usage and feedback.
INSIGHT

Queues Essential for AI Workflows

  • Most applications need queues and event systems for workflows and state management.
  • These complex pipelines are crucial for AI workflows but challenging to manage reliably at scale.
INSIGHT

Fairness and Concurrency Challenges

  • AI workloads require multi-tenant fairness and concurrency due to costly, limited GPU resources.
  • Managing queues in AI apps is complex but essential to avoid poor user experiences and costs.
Get the Snipd Podcast app to discover more snips from this episode
Get the app