No Priors: Artificial Intelligence | Technology | Startups

Speed will win the AI computing battle with Tuhin Srivastava from Baseten

40 snips

Mar 21, 2024

Tuhin Srivastava, CEO and co-founder of Baseten, specializes in scalable AI infrastructure. He shares insights on why speed is crucial for AI development, emphasizing the need for efficient code rather than no-code solutions. The conversation highlights surprising use cases for Baseten and discusses the challenges of AI training and deployment in the current landscape. Tuhin also tackles the impact of hardware shortages and the defensibility of jobs in AI, arguing that strategic investment in computing is essential for future success.

Ask episode

AI Snips

Chapters

Transcript

Episode notes

INSIGHT

Training vs. Inference Workloads

Training workloads prioritize co-locating GPUs and network optimization.
Inference workloads prioritize co-location with user activity and repeatable workflows, favoring resiliency and reliability.

ANECDOTE

Baseten's Surprising Growth

Baseten saw unexpected market acceleration in late 2022 and 2023 after a quiet period.
Teams prioritize speed in AI, leading to increased buy vs. build decisions for infrastructure.

INSIGHT

Optimizing Inference Performance

Inference optimization involves maximizing GPU usage, scaling across GPUs, and staying up-to-date with open-source advancements.
Baseten's partnership with NVIDIA and TRT-LLM has driven performance gains, focusing on low-level optimizations and open-source contributions.

Get the Snipd Podcast app to discover more snips from this episode

Get the app