I use kmeans for clustering a set of data. However, I have to specify the number of clusters. The problem is that sometimes I need 2 and other times I need 3 clusters.
- Is there a clustering algorithm that could incorporate that feature in it?
I use kmeans for clustering a set of data. However, I have to specify the number of clusters. The problem is that sometimes I need 2 and other times I need 3 clusters.
this is a great paper to start with:
Estimating the number of clusters in a data set via the gap statistics
It's really easy to implement something similary in any language.
Simplest solution: do both and then check which gives best results...