DataFramed cover image

#245 Can We Make Generative AI Cheaper? With Natalia Vassilieva, Senior VP & Field CTO & Andy Hock, VP, Product & Strategy at Cerebras Systems

DataFramed

CHAPTER

Optimizing Generative AI Deployment

This chapter explores the complexities of efficiently building and deploying generative AI models, focusing on the critical choice of hardware for inference. It discusses trade-offs between latency and throughput in applications like chatbots and drug discovery, while also speculating on future terminology for large language models.

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner