I performed a clustering analysis on European countries using several continuous variables and the Ward linkage method. Ward produced the highest average silhouette width compared to single, complete, and average linkage. Although the global silhouette plot suggests an optimal k = 2, there is a meaningful local optimum at k = 7 that better reflects the structure I want to analyze.
However, using k = 7 results in one cluster containing only a single country and another cluster with only two countries. I calculated the median and IQR for each cluster, except for the cluster with a single observation.
My question is: can I still apply a non-parametric Kruskal–Wallis test to compare all seven clusters, and subsequently perform Dunn’s post-hoc test with Bonferroni correction, even though one cluster contains only a single observation and another contains only two observations?