
k-means clustering
Data Skeptic
00:00
The Problem of Gaussian Blobs of Data
The most popular approach to solvenca means clustering. If your data set fits this format, you're in luck. Algorithms like camines are built for data that's gausian blobs. In the world, data sets are rarely so well distributed. A few of those data points with a partitioning other than what your instinct says can be misleading.
Transcript
Play full episode