Evaluation and Testing Processes for Agents in Systems

This chapter explores the critical evaluation and testing processes for agents in a system, highlighting the significance of methodical testing stages. It emphasizes how thorough testing, including edge cases, can improve the reliability of tool and agent updates.

Transcript

chevron_right

Play full episode

chevron_right

Transcript

Episode notes

Francisco Ingham, LLM consultant, NLP developer, and founder of Pampa Labs.Making Your Company LLM-native

// MLOps Podcast #266 with Francisco Ingham, Founder of Pampa Labs.

// Abstract

Being an LLM-native is becoming one of the key differentiators among companies in vastly different verticals. Everyone wants to use LLMs, and everyone wants to be on top of the current tech, but what does it really mean to be LLM-native?

LLM-native involves two ends of a spectrum. On the one hand, we have the product or service that the company offers, which surely offers many automation opportunities. LLMs can be applied strategically to scale at a lower cost and offer a better experience for users.

But being LLM-native not only involves the company's customers, it also involves each stakeholder involved in the company's operations. How can employees integrate LLMs into their daily workflows? How can we, as developers, leverage the advancements in the field not only as builders but as adopters?

We will tackle these and other key questions for anyone looking to capitalize on the LLM wave, prioritizing real results over the hype.

// Bio

Currently working at Pampa Labs, where we help companies become AI-native and build AI-native products. Our expertise lies on the LLM-science side, or how to build a successful data flywheel to leverage user interactions to continuously improve the product. We also spearhead Pampa-friends - the first Spanish-speaking community of AI Engineers.

Previously worked in management consulting, was a TA in fastai in SF, and led the cross-AI + dev tools team at Mercado Libre.

// MLOps Jobs board

jobs.mlops.community

// MLOps Swag/Merch

https://mlops-community.myshopify.com/

// Related Links

Website: pampa.ai

--------------- ✌️Connect With Us ✌️ -------------

Join our Slack community: https://go.mlops.community/slack

Catch all episodes, blogs, newsletters, and more: https://mlops.community/

Connect with Demetrios on LinkedIn: https://www.linkedin.com/in/dpbrinkm/

Connect with Francisco on LinkedIn: https://www.linkedin.com/in/fpingham/