
561: Engineering Data APIs
Super Data Science: ML & AI Podcast with Jon Krohn
00:00
Exploration of Knowledge Graphs, Machine Learning Models, and Entity Naming Resolution
Starting with a knowledge graph, this chapter delves into manual data sampling, using labels for an XGBoost model to predict scores presented on a user-friendly scale. Einblik, a data science platform, is introduced with unique features, including a progressive engine and a blend of no-code operations and Python code. The discussion also covers a machine learning method employed at Ribbon Health to tackle entities with multiple names through PCA dimensionality reduction and clustering with Gaussian Mixture Modeling.
Play episode from 31:00
Transcript


