
The Building Blocks of Agentic Systems with Harrison Chase - #698
The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)
Evolving Evaluation for LLMs
This chapter examines the adaptive evaluation process for machine learning models, especially large language models. It highlights the significance of observability, continual learning, and the development of bespoke metrics and datasets to enhance the evaluation and user experience.
00:00
Transcript
Play full episode
Remember Everything You Learn from Podcasts
Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.