Get the app
Kwasi Ankomah
Lead AI architect at SambaNova Systems specializing in agentic AI and efficient inference; discussed SambaNova's RDU chip architecture, energy-efficient inference, and multi-model serving for real-world production applications.
Best podcasts with Kwasi Ankomah
Ranked by the Snipd community
29 snips
Oct 7, 2025
• 53min
AI Inference: Why Speed Matters More Than You Think (with SambaNova's Kwasi Ankomah)
chevron_right
Kwasi Ankomah, Lead AI Architect at SambaNova Systems, dives into the significance of AI inference and its bottlenecks. He explains how their innovative RDU chip architecture delivers over 700 tokens per second while using 90% less power. The discussion highlights the growing issue of latency with AI agents, emphasizing their increased token demands. Kwasi also explores multi-model serving to optimize costs and performance, and shares insights on the future of open-source models tailored for enterprises.
The AI-powered Podcast Player
Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
Get the app