I want to compute the distance matrix for the columns on a 1000 x 230 matrix using the dist()
function in R. Though, I am uncertain about which method to use.
I know the differences between the methods and how the algorithm works, but I would like to hear from you which one will you prefer when working on gene expression data sets. The values are z-scores derived from normalized gene expression data. That is, data are real-valued from -10 to 10 (roughly) and it is important if the value is negative or positive (means: if the gene is upregulated or downregulated).
Could be rephrased to make it more specific: Which of the methods dist() function supports could be the optimal to understand the differences and similarities for the columns of my matrix.