Get the app
Val Bercovici
Longtime storage and infrastructure expert and Chief AI Officer at Weka, focused on AI product strategy, AI-enabled storage, and disaggregated pre-fill/decode for inference.
Best podcasts with Val Bercovici
Ranked by the Snipd community
Jul 7, 2025
• 47min
A Conversation with Val Bercovici about Disaggregated Prefill / Decode
chevron_right
Val Bercovici, Chief AI Officer at Weka and a veteran in storage infrastructure, delves into the fascinating world of disaggregated pre-fill and decode for AI workloads. He discusses how inference is now taking center stage in AI spending and the challenges it presents. Bercovici explains how the split between pre-fill and decode enhances GPU utilization and efficiency, likening it to an assembly-line architecture. He highlights Weka's innovative solutions in software-defined memory, which promise to revolutionize memory management and scalability in AI-heavy environments.
The AI-powered Podcast Player
Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
Get the app