Episode 45: Your AI application is broken. Here’s what to do about it.

Feb 20, 2025

Joining the discussion is Hamel Husain, a seasoned ML engineer and open-source contributor, who shares invaluable insights on debugging generative AI systems. He emphasizes that understanding data is key to fixing broken AI applications. Hamel advocates for spreadsheet error analysis over complex dashboards. He also highlights the pitfalls of trusting LLM judges blindly and critiques existing AI dashboard metrics. His practical methods will transform how developers approach model performance and iteration in AI.

Ask episode

AI Snips

Chapters

Transcript

Episode notes

ADVICE

Look at Your Data

Systematically look at your data to debug AI applications.
Many people claim to look at their data, but don't do so effectively.

INSIGHT

Spreadsheet Error Analysis

Spreadsheet-based error analysis quickly uncovers failure modes.
This low-tech approach clarifies what to work on next and how to measure it.

ADVICE

Start with Notes

When starting error analysis, begin with simple notes.
Avoid premature categorization; observe and document first.

Get the Snipd Podcast app to discover more snips from this episode

Get the app