
Exploratory Data Analysis (EDA) in Machine Learning - ML 075
Adventures in Machine Learning
00:00
Summary Stats - A Great Tool for Visualizations
If we have really large data sets, often plotting a histogram for each variable is pretty expensive. If you can find good summary stats like skew, kurtosis, whatever your go-tos are, they can help effectively and efficiently help you understand what's going on in your data. A QQ plot essentially maps the percent or quantiles of one distribution versus another. The important thing is to know where to go to look for what those things actually are.
Transcript
Play full episode