Questions tagged [spearman-rho]

Spearman's rank correlation coefficient, usually denoted as $\rho$, is a measure of concordance between two random variables.

Spearman's rho is a measure of concordance between two random variables. It is the linear (Pearson) correlation of the probability transformed random variables.

Let $r_{i,j}$ be the rank of $y_{i,j}$ among responses $\{ y_{1,j}, ..., y_{n,j}\}$ for $i = \{1, ..., n\}, j \in {1,2}$

$$\hat{\rho} = \text{Cor}[(r_{1,1},...,r_{n,1}),(r_{1,2},...r_{n,2})]$$

This measure is between -1 and +1 and depends only on the rank of the data values and so is invariant to monotonic transformations (unlike usual moment correlation).

It can also be expressed as a measure of concordance based on ranks, and has many properties in common with Kendall's tau.

For more information:


English psychologist Charles Spearman is best known for the Spearman rank correlation coefficient, or Spearman's rho. He also worked in factor analysis and is known for his work on models for human intelligence.

His best known work, the Spearman rank correlation coefficient, or Spearman's rho, is simply a Pearson correlation coefficient computed on the ranks of the data. It is a widely used measure of monotonic association between variables. It is unaffected by monotonic increasing transformation of the variables.

The Spearman correlation has good efficiency properties for measuring linear correlation at the normal and has much better robustness than the Pearson coefficient in the presence of outliers.

408 questions
145
votes
5 answers

How to choose between Pearson and Spearman correlation?

How do I know when to choose between Spearman's $\rho$ and Pearson's $r$? My variable includes satisfaction and the scores were interpreted using the sum of the scores. However, these scores could also be ranked.
user3636
143
votes
6 answers

Pearson's or Spearman's correlation with non-normal data

I get this question frequently enough in my statistics consulting work, that I thought I'd post it here. I have an answer, which is posted below, but I was keen to hear what others have to say. Question: If you have two variables that are not…
Jeromy Anglim
  • 42,044
  • 23
  • 146
  • 250
91
votes
5 answers

Kendall Tau or Spearman's rho?

In which cases should one prefer the one over the other? I found someone who claims an advantage for Kendall, for pedagogical reasons, are there other reasons?
Tal Galili
  • 19,935
  • 32
  • 133
  • 195
64
votes
5 answers

Is it meaningful to calculate Pearson or Spearman correlation between two Boolean vectors?

There are two Boolean vectors, which contain 0 and 1 only. If I calculate the Pearson or Spearman correlation, are they meaningful or reasonable?
Zhilong Jia
  • 785
  • 1
  • 6
  • 9
58
votes
5 answers

Correlations between continuous and categorical (nominal) variables

I would like to find the correlation between a continuous (dependent variable) and a categorical (nominal: gender, independent variable) variable. Continuous data is not normally distributed. Before, I had computed it using the Spearman's $\rho$.…
33
votes
3 answers

How do the Goodman-Kruskal gamma and the Kendall tau or Spearman rho correlations compare?

In my work, we are comparing predicted rankings versus true rankings for some sets of data. Up until recently, we've been using Kendall-Tau alone. A group working on a similar project suggested we try to use the Goodman-Kruskal Gamma instead, and…
Poik
  • 520
  • 1
  • 6
  • 10
28
votes
3 answers

If linear regression is related to Pearson's correlation, are there any regression techniques related to Kendall's and Spearman's correlations?

Maybe this question is naive, but: If linear regression is closely related to Pearson's correlation coefficient, are there any regression techniques closely related to Kendall's and Spearman's correlation coefficients?
sitems
  • 3,649
  • 1
  • 25
  • 52
24
votes
1 answer

What could cause big differences in correlation coefficient between Pearson's and Spearman's correlation for a given dataset?

The Pearson's coefficient between two variables is quite high (r=.65). But when I rank the variable values and run a Spearman's correlation, the cofficient value is much lower (r=.30). What is the interpretation of this?
user3671
  • 1,059
  • 2
  • 12
  • 20
23
votes
1 answer

Are random variables correlated if and only if their ranks are correlated?

Assume $X,Y$ are continuous random variables with finite second moments. The population version of Spearman's rank correlation coefficient $ρ_s$ can be defined as the Pearson's product-moment coefficient ρ of the probability integrals transforms…
FSpanhel
  • 231
  • 1
  • 3
23
votes
3 answers

Why is Pearson parametric and Spearman non-parametric

Apparently Pearson's correlation coefficient is parametric and Spearman's rho is non-parametric. I'm having trouble understanding this. As I understand it Pearson is computed as $$ r_{xy} = \frac{cov(X,Y)}{\sigma_x\sigma_y} $$ and Spearman is…
user2740
  • 1,226
  • 2
  • 12
  • 19
20
votes
5 answers

Which robust correlation methods are actually used?

I plan to do a simulation study where I compare the performance of several robust correlation techniques with different distributions (skewed, with outliers, etc.). With robust, I mean the ideal case of being robust against a) skewed distributions,…
Felix S
  • 4,432
  • 4
  • 26
  • 34
19
votes
4 answers

Prove the equivalence of the following two formulas for Spearman correlation

From wikipedia, Spearman's rank correlation is calculated by converting variables $X_i$ and $Y_i$ into ranked variables $x_i$ and $y_i$, and then calculating Pearson's correlation between the ranked variables: However, the article goes on to state…
Alex
  • 3,728
  • 3
  • 25
  • 46
19
votes
2 answers

How to calculate a confidence interval for Spearman's rank correlation?

Wikipedia has a Fisher transform of the Spearman rank correlation to an approximate z-score. Perhaps that z-score is the difference from null hypothesis (rank correlation 0)? This page has the following example: 4, 10, 3, 1, 9, 2, 6, 7, 8, 5 5, 8,…
dfrankow
  • 2,816
  • 6
  • 30
  • 39
18
votes
2 answers

How to correctly assess the correlation between ordinal and a continuous variable?

I'd like to estimate the correlation between: An ordinal variable: subjects are asked to rate their preference for 6 types of fruit on a 1-5 scale (ranging from very disgusting to very tasty) On average subjects use only 3 points of the scale. A…
San
  • 181
  • 1
  • 1
  • 3
16
votes
2 answers

Canonical correlation analysis with rank correlation

Canonical correlation analysis (CCA) aims to maximize the usual Pearson product-moment correlation (i.e. linear correlation coefficient) of the linear combinations of the two data sets. Now, consider the fact that this correlation coefficient only…
1
2 3
27 28