4min chapter

Data Skeptic cover image

k-means clustering

Data Skeptic

CHAPTER

Unsupervised Learning and Camean's Clustering

Camean's clustering is the poster child for unsupervised learning. The algorithm requires one perameter, which means if your a set has n elements, then there are k to the end, possible labellings you could output. There's no way we can check every possible combination of labellings to find the best one if that n gets too big. For this optimization problem, the best score is the one that minimizes the average distance between your data and the associated centroids.

00:00

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode