DataFramed cover image

#245 Can We Make Generative AI Cheaper? With Natalia Vassilieva, Senior VP & Field CTO & Andy Hock, VP, Product & Strategy at Cerebras Systems

DataFramed

00:00

Optimizing Generative AI Deployment

This chapter explores the complexities of efficiently building and deploying generative AI models, focusing on the critical choice of hardware for inference. It discusses trade-offs between latency and throughput in applications like chatbots and drug discovery, while also speculating on future terminology for large language models.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app