Practical AI

Open source data labeling tools

Nov 5, 2019
Michael Malyuk, CEO of Heartex and a contributor to Label Studio, shares his expertise on the crucial yet challenging process of data labeling in AI. He discusses the significant impact of accurate labeling on model performance and highlights various open-source tools that aim to streamline the process. Malyuk addresses common pitfalls, including time consumption and bias, and explores innovative solutions like Label Studio that enhance collaboration and customization for users. The conversation sheds light on the evolving landscape of data labeling and its critical role in AI development.
Ask episode
AI Snips
Chapters
Books
Transcript
Episode notes
ANECDOTE

Michael's AI Beginnings

  • Michael Malyuk's AI journey began with Lisp programming and reading Peter Norvig's "Paradigms of Artificial Intelligence Programming".
  • This sparked his interest in building production-level AI systems, including math and statistics.
ANECDOTE

Heartex's High-Altitude Origin

  • Michael and his co-founder conceived Heartex during a high-altitude hiking trip in the Himalayas.
  • They recognized data labeling as a significant, unsolved problem in AI development.
INSIGHT

Data Labeling's Core Role

  • Data labeling is the core of any AI product, as it directly impacts model quality.
  • Labeling helps explore datasets, uncover edge cases, and refine understanding of the data.
Get the Snipd Podcast app to discover more snips from this episode
Get the app