After running clustering, I'd like to explore the results and hopefully get some sort of concrete idea of where (i.e. which on which of my n
variables) differences arise between clusters.
What if I run n
one-way ANOVAs with cluster as my IV and each of n
variables as my DV? I am aware that this is both double dipping (the clustering separated the most distinct data points so there will definitely be differences) and would lead to inflated p values (from performing n
tests). Yet the only reason why I am doing this is as a post-hoc exploration, so I was wondering if it would be valid simply to give an indication of where those differences are?
If not then what alternative might there be to learn more about the clustering output?