
🏃♀️Moving Fast and Breaking Data with Shreya Shankar
The MLOps Podcast
00:00
The Challenges of Automating Clustering
The paper is out. So if you want to read it and then try to implement it, I want to write some code for it. It's not too hard. You just summarize each partition. The choice of summary statistics matters, but we list that in the paper. You cluster the summary statistics, so you identify correlated features. And then you just do a nearest neighbors on your clusters. If the cluster is found anomalous, then those are your broken features.
Transcript
Play full episode
Remember Everything You Learn from Podcasts
Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.