The Data Stack Show cover image

205: How to make LLMs Boring (Predictable, Reliable, and Safe), Featuring Nicolay Gerold

The Data Stack Show

00:00

Monitoring and Testing in AI

This chapter explores the essential practices of monitoring and testing in the development of Large Language Models and generative AI. It underscores the importance of establishing thorough monitoring systems and quantifiable testing methods to ensure the reliability and effectiveness of AI applications, highlighting the risks of inadequate validation.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app