I have an n-by-m array, where every column sums to 1, in other words I have m probability vectors of size n. I would like to cluster them into several categories.
I will appreciate, if somebody points me to a robust method that can be used for this purpose.
The crucial point here is that these are probability vectors. So I am reluctant to use anything that is based on Euclidean distance.