Clustering
Clustering
Chapter 10 of
Introduction to Statistical Learning
By Gareth James, et al.
K Means Clustering
If you plot k against the SSE, you will see that the error
decreases as k gets larger; this is because when the
number of clusters increases, they should be smaller,
so distortion is also smaller.
The idea of the elbow method is to choose the k at
which the SSE decreases abruptly.
This produces an "elbow effect" in the graph, as you
can see in the following picture:
Choosing a K Value
Choosing a K Value
● They’ve been
recently
hacked and
need your help
finding out
about the
hackers!
Python and Spark