May 25'23
Exercise
Determine which of the following statements about clustering is/are true.
- Cutting a dendrogram at a lower height will not decrease the number of clusters.
- K-means clustering requires plotting the data before determining the number of clusters.
- For a given number of clusters, hierarchical clustering can sometimes yield less accurate results than K-means clustering.
- None
- I and II only
- I and III only
- II and III only
- The correct answer is not given by (A), (B), (C), or (D).
May 26'23
Key: C
I is true. At the lowest height, each observation is its own cluster. The number of clusters decreases as the height increases.
II is false. There is no need to plot the data to perform K-means clustering.
III is true. K-means does a fresh analysis for each value of K while for hierarchical clustering, reduction in the number of clusters is tied to clusters already made. This can miss cases where the clusters are not nested.