5min chapter

MLOps.community  cover image

Cost/Performance Optimization with LLMs [Panel]

MLOps.community

CHAPTER

How to Optimize Your Chatbot

The GPT four is a fine average of one label. That's pretty much the best you're going to get off like and Turk train judges, and it works across fluency accuracy other things. We found that if you ever try to compare two things, the outputs don't work as well. It's better to reverse the problem and say, how can I evaluate what's failing in product which users are getting bad results? The chatbot being rude is kind of the most trivial example of this bus. Yeah, to be direct on that that we're calling this critic modeling and it works extremely well, especially with the most recent models. And then potentially using some like weak supervision signal to

00:00

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode