2min chapter

MLOps.community  cover image

Large Language Models in Production Round-table Conversation

MLOps.community

CHAPTER

The Importance of Re-Architecting Production

State of the art right now is a couple milliseconds I think actually like there was a paper that talked about state of the art in kind of like what I call in a lab environment was a 29 millisecond and inference pass. If you're just using it for your koi app at you know and on your computer then I wouldn't consider that production so rewinding a little bit we'll go back to you know what is it going to take we're still on the posubgic of latency.

00:00

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode