How can I measure linear correlation of non-normally distributed variables? Pearson coefficient is not valid for non-normally distributed data, and Spearman's rho does not capture linear correlation.
Thank you
How can I measure linear correlation of non-normally distributed variables? Pearson coefficient is not valid for non-normally distributed data, and Spearman's rho does not capture linear correlation.
Thank you
Why do you require normality for computing a correlation? How about a simple scatterplot? As long as the data are continuous, ordinary (Pearson) correlation should be fine. All that it is measuring is the strength of the linear relationship between two variables (if indeed there is such a relationship).
In addition to Anscombe's quartet as mentioned by Peter Flom, here is a very nice paper in the risk-management context illustrating the problems of using linear correlation with non-normally distributed variables. In a nutshell, much of our intuition about how correlation behaves -- all values of $\rho \in [-1, 1]$ are possible; an exact monotonic relationship implies $|\rho | = 1$; $\rho = 0$ implies independence; etc, doesn't necessarily apply in the case of non-normality.
What @galit said is absolutely right. You can find the linear correlation between any two continuously distributed variables.
But perhaps you are thinking of the meaning of such a correlation? Indeed, Anscombe's quartet shows that while the correlation is defined for any pair of continuous variables, and its mathematical and statistical meaning is the same, its substantive meaningfulness may vary.