MLOps.community  cover image

Cost/Performance Optimization with LLMs [Panel]

MLOps.community

00:00

The Importance of Scalability in Semantic Search

A lot of our work we're moving like for some of our semantic search. These are small query encoders is a 30 million 40 million a hundred million parameter models. So it sits in the same system that's actually doing the retrieval. And so that can just simplify production. It's just a question on really like figuring out what scale you're going to go to.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app