Spearman's Rank Correlation Statistic
(Redirected from Spearman Correlation)
Jump to navigation
Jump to search
A Spearman's Rank Correlation Statistic is a non-parametric rank correlation between the ranking of two variables.
- AKA: Spearman's rho, Spearman Correlation Coefficient, Spearman Rank Correlation Coefficient, Spearman R.
- Context:
- It can be defined as [math]\displaystyle{ \rho = 1 - \frac{6 \; \sum d_i^2}{n \; (n^2 - 1)} }[/math] where [math]\displaystyle{ d_i }[/math] is the difference between ranks of each observation on two random variables and [math]\displaystyle{ n }[/math] is the number of observations.
- …
- Counter-Example(s):
- See: Spearman Correlation Test, General Correlation Coefficient, Non-Parametric Statistics, Correlation and Dependence, Level of Measurement.
References
2017a
- (CM, 2017) ⇒ http://changingminds.org/explanations/research/analysis/spearman.htm
- The Spearman Rank Correlation Coefficient is a form of the Pearson coefficient with the data converted to rankings (ie. when variables are ordinal). It can be used when there is non-parametric data and hence Pearson cannot be used.
- The raw scores are converted to ranks and the differences ([math]\displaystyle{ d_i }[/math]) between the ranks of each observation on the two variables are calculated. The Spearman coefficient is denoted with the Greek letter rho ([math]\displaystyle{ \rho }[/math]).
- [math]\displaystyle{ \rho = 1 - (6 * SUM(d_i^2)) / (n * (n^2 - 1)) }[/math]
- (...) The Spearman Coefficient can be used to measure ordinal data (ie. in rank order), not interval (as Pearson). It effectively works by first ranking the data then applying Pearson's calculation to the rank numbers.
- This coefficient is also called Spearman's rho (after the Greek letter used).
- The raw scores are converted to ranks and the differences ([math]\displaystyle{ d_i }[/math]) between the ranks of each observation on the two variables are calculated. The Spearman coefficient is denoted with the Greek letter rho ([math]\displaystyle{ \rho }[/math]).
2017b
- (Quest Software Inc., 2017) ⇒ Statistics – Textbook, Nonparametric Statistics https://documents.software.dell.com/statistics/textbook/nonparametric-statistics#correlations
- Spearman R (Siegel & Castellan, 1988) assumes that the variables under consideration were measured on at least an ordinal (rank order) scale, that is, that the individual observations can be ranked into two ordered series. Spearman R can be thought of as the regular Pearson product moment correlation coefficient, that is, in terms of proportion of variability accounted for, except that Spearman R is computed from ranks.
2017c
- (Stat 509, 2017) ⇒ Design and Analysis of Clinical Trials, The Pennsylvania State University 18.2 - Spearman Correlation Coefficient https://onlinecourses.science.psu.edu/stat509/node/157
- The Spearman rank correlation coefficient, [math]\displaystyle{ r_s }[/math] , is a nonparametric measure of correlation based on data ranks. It is obtained by ranking the values of the two variables ([math]\displaystyle{ X }[/math] and [math]\displaystyle{ Y }[/math]) and calculating the Pearson [math]\displaystyle{ r_p }[/math] on the resulting ranks, not the data itself. Again, PROC CORR will do all of these actual calculations for you.
- The Spearman rank correlation coefficient has properties similar to those of the Pearson correlation coefficient, although the Spearman rank correlation coefficient quantifies the degree of linear association between the ranks of [math]\displaystyle{ X }[/math] and the ranks of [math]\displaystyle{ Y }[/math]. Also, [math]\displaystyle{ r_s }[/math] does not estimate a natural population parameter (unlike Pearson's [math]\displaystyle{ r_p }[/math] which estimates [math]\displaystyle{ \rho_p }[/math] ).
- An advantage of the Spearman rank correlation coefficient is that the [math]\displaystyle{ X }[/math] and [math]\displaystyle{ Y }[/math] values can be continuous or ordinal, and approximate normal distributions for [math]\displaystyle{ X }[/math] and [math]\displaystyle{ Y }[/math] are not required. Similar to the Pearson [math]\displaystyle{ r_p }[/math] , Fisher's Z transformation can be applied to the Spearman [math]\displaystyle{ r_s }[/math] to get a statistic, [math]\displaystyle{ z_s }[/math], that has an asymptotic normal distribution for calculating an asymptotic confidence interval.
2015
- (Wikipedia, 2015) ⇒ http://en.wikipedia.org/wiki/Spearman's_rank_correlation_coefficient Retrieved:2015-8-20.
- In statistics, Spearman's rank correlation coefficient or Spearman's rho, named after Charles Spearman and often denoted by the Greek letter [math]\displaystyle{ \rho }[/math] (rho) or as [math]\displaystyle{ r_s }[/math] , is a nonparametric measure of statistical dependence between two variables. It assesses how well the relationship between two variables can be described using a monotonic function. If there are no repeated data values, a perfect Spearman correlation of +1 or −1 occurs when each of the variables is a perfect monotone function of the other.
Spearman's coefficient, like any correlation calculation, is appropriate for both continuous and discrete variables, including ordinal variables. [1] Spearman's [math]\displaystyle{ \rho }[/math] and Kendall's [math]\displaystyle{ \tau }[/math] can be formulated as special cases of a more general correlation coefficient.
- In statistics, Spearman's rank correlation coefficient or Spearman's rho, named after Charles Spearman and often denoted by the Greek letter [math]\displaystyle{ \rho }[/math] (rho) or as [math]\displaystyle{ r_s }[/math] , is a nonparametric measure of statistical dependence between two variables. It assesses how well the relationship between two variables can be described using a monotonic function. If there are no repeated data values, a perfect Spearman correlation of +1 or −1 occurs when each of the variables is a perfect monotone function of the other.
2011
- (Sammut & Webb, 2011) ⇒ Claude Sammut, and Geoffrey I. Webb. (2011). “Rank Correlation.” In: (Sammut & Webb, 2011) p.828
- QUOTE: Spearman’s Rank correlation calculates the sum of squared rank distances and is normalized such that it evaluates to − 1 for reversed and to + 1 for identical rankings. Formally, it is defined as follows: [math]\displaystyle{ (τ,τ′)↦ 1 − \frac{6∑^m_{i=1}(τ(i)−τ′(i))}{2m(m^2−1)} \ (1) }[/math]