Data Skeptic cover image

k-means clustering

Data Skeptic

00:00

The Importance of Units Matters in a Machine Learning Optimization Problem

If you blindly ask a machine learning algorithm to solve an optimization problem on data like this, it will absolutely exploit the unnormalized nature of your data. Most software implementations of camen's clustering will automatically apply normalization of your data in a standard statistical way. The vast majority of listeners will never encounter the scale of data set that requires you to take special care.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app