Introduction

A conversation with guest Sam, emphasizing the importance of simplicity and understanding metrics in creating LLM systems. The discussion provides insights on avoiding the trap of unnecessary complexity and progressing effectively in building LLM systems.

Play episode from 00:00

chevron_right

Transcript

chevron_right

Transcript

Episode notes

Thank you to Zilliz, our wonderful sponsors of this episode create some amazing stuff with Zilliz RAG - https://zilliz.com/vector-database-use-cases/llm-retrieval-augmented-generation

Sam Bean is a seasoned AI and machine learning expert, specializing in Large Language Models (LLMs) and search tech.

With a computer science background and a drive for innovation, Sam leads the team at Rewind AI in leveraging advanced tech to tackle complex challenges.MLOps podcast #217 with Sam Bean, Software Engineer (Applied AI) at Rewind.ai, The Real E2E RAG Stack.

// Abstract

What does a fully operational LLM + Search stack look like when you're running your own retrieval and inference infrastructure? What does the flywheel really mean for RAG applications? How do you maintain the quality of your responses? How do you prune/dedupe documents to maintain your document quality?

// Bio

Sam has been training, evaluating, and deploying production-grade inference solutions for language models for the past 2 years at You.com. Prior to that, he built personalization algorithms at StockX.

// MLOps Jobs board

jobs.mlops.community

// MLOps Swag/Merch

https://mlops-community.myshopify.com/

// Related Links

Website: https://github.com/sam-h-bean/

REinforced Self Training (REST) - https://arxiv.org/pdf/2308.08998.pdf

REST meets REACT - https://arxiv.org/pdf/2312.10003.pdf

--------------- ✌️Connect With Us ✌️ -------------

Join our Slack community: https://go.mlops.community/slack

Catch all episodes, blogs, newsletters, and more: https://mlops.community/

Connect with Demetrios on LinkedIn: https://www.linkedin.com/in/dpbrinkm/