Building Production Workflows for AI Applications

20 snips

Jun 14, 2024

In this podcast, Tony Holdstock-Brown discusses the challenges of running AI workflows in production. He highlights the parallel tracks of CPU and GPU engineering, emphasizing the differences between application-level and mathematical sides. The conversation explores opportunities for improvement in developer tools for generative AI and offers advice for engineers entering the field.

Ask episode

AI Snips

Chapters

Transcript

Episode notes

ANECDOTE

Tony's Early Coding Journey

Tony Holdstock-Brown shared how he started coding young, making simple games and joke machines.
His early projects transitioned into internet apps for broader usage and feedback.

INSIGHT

Queues Essential for AI Workflows

Most applications need queues and event systems for workflows and state management.
These complex pipelines are crucial for AI workflows but challenging to manage reliably at scale.

INSIGHT

Fairness and Concurrency Challenges

AI workloads require multi-tenant fairness and concurrency due to costly, limited GPU resources.
Managing queues in AI apps is complex but essential to avoid poor user experiences and costs.

Get the Snipd Podcast app to discover more snips from this episode

Get the app