14min chapter

The AI Native Dev - from Copilot today to AI Native Software Development tomorrow cover image

AI Evaluation and Testing: How to Know When Your Product Works (or Doesn’t)

The AI Native Dev - from Copilot today to AI Native Software Development tomorrow

CHAPTER

Evaluating AI: Glean's Approach with LLMs

This chapter explores Glean's innovative use of large language models (LLMs) for evaluating AI systems, particularly in enterprise search across sensitive data. It discusses the challenges of ensuring accuracy and reliability in AI responses while maintaining customer data privacy. The conversation highlights the importance of user education and the development of evaluation metrics to enhance product effectiveness and customer satisfaction.

00:00

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode