Tool Use - AI Conversations

The Blueprint For AI Agents That Work (ft Diamond Bishop)

40 snips
Jun 24, 2025
Join Diamond Bishop, the Director of Engineering and AI at Datadog, as he shares his expertise in building self-improving AI agents. He discusses strategies for establishing robust evaluation systems, fostering user trust, and choosing between prompt engineering and fine-tuning. Diamond also explores the balance of AI agents versus traditional scripts, the importance of dataset management, and the exciting future of ambient AI in DevSecOps. This conversation is packed with essential insights for anyone looking to create effective, adaptive AI systems.
Ask episode
AI Snips
Chapters
Transcript
Episode notes
ANECDOTE

Weekly AI Updates with LM Notebook

  • Diamond writes a weekly AI update newsletter using LM Notebook.
  • This practice helps him stay informed and share curated insights internally and externally.
ADVICE

Manage Custom Prompt Risks

  • Log user customizations and evaluate their impact using your eval suite.
  • Provide guardrails or warnings if custom prompts negatively affect performance, but allow choices.
ADVICE

Synthetic Data to Address Imbalance

  • Use synthetic data to augment rare or underrepresented cases in eval datasets.
  • For more balanced datasets matching real use, production data alone may suffice.
Get the Snipd Podcast app to discover more snips from this episode
Get the app