Data Skeptic cover image

k-means clustering

Data Skeptic

00:00

Unsupervised Learning and Camean's Clustering

Camean's clustering is the poster child for unsupervised learning. The algorithm requires one perameter, which means if your a set has n elements, then there are k to the end, possible labellings you could output. There's no way we can check every possible combination of labellings to find the best one if that n gets too big. For this optimization problem, the best score is the one that minimizes the average distance between your data and the associated centroids.

Play episode from 08:43
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app