How AI Is Built

#043 Knowledge Graphs Won't Fix Bad Data

Feb 20, 2025
Juan Sequeda, a Principal Scientist at data.world and an authority on knowledge graphs, shares his insights on improving data quality. He discusses the importance of integrating technical and business metadata to create a 'brain' for AI applications. Sequeda explains how traditional silos hinder effective data management and emphasizes the need for collaboration in startups. He also addresses the balance between automation and human oversight in knowledge graphs and outlines strategies for defining robust entities and relationships, ensuring accurate data connections.
Ask episode
AI Snips
Chapters
Books
Transcript
Episode notes
INSIGHT

Graph Analytics Reveal Hidden Bottlenecks

  • Graph analytics reveal structural priorities like bottlenecks and central nodes in metadata lineage that signal risk and maintenance focus.
  • Juan recommends using graph metrics to find critical tables and prioritize ownership and documentation.
ANECDOTE

Extending Metadata To People And Decisions

  • Juan describes adding employees, models, and business processes into the metadata graph to connect people to data and decisions.
  • He recounts modeling pricing decisions, linking data inputs, people, and decisions to trace business lineage.
INSIGHT

Graph RAG Needs Curated Knowledge

  • Graph RAG and text-extracted knowledge graphs help LLMs by adding explicit relationships beyond embeddings.
  • But automatically built text graphs often lack the curated 'knowledge' layer and need governance to avoid inconsistent entities.
Get the Snipd Podcast app to discover more snips from this episode
Get the app