Using scipy, I noticed that I am allowed to use only Euclidean distance for Ward's method.
Is it because Ward's uses Error Sum of Squared?
What if I use Ward's method with cosine similarity?
Cosine similarity seems still work so, but maybe not perfectly.
What might be the difference?