From Surf Wiki (app.surf) — the open knowledge base
Correlation coefficient
Numerical measure of a statistical relationship between variables
Numerical measure of a statistical relationship between variables
A correlation coefficient is a numerical measure of some type of linear correlation, meaning a linear function between two variables. The variables may be two columns of a given data set of observations, often called a sample, or two components of a multivariate random variable with a known distribution.
Several types of correlation coefficient exist, each with their own definition and own range of usability and characteristics. They all assume values in the range from −1 to +1, where ±1 indicates the strongest possible correlation and 0 indicates no correlation. As tools of analysis, correlation coefficients present certain problems, including the propensity of some types to be distorted by outliers and the possibility of incorrectly being used to infer a causal relationship between the variables (for more, see Correlation does not imply causation).
Types
There are several different measures for the degree of correlation in data, depending on the kind of data: principally whether the data is a measurement, ordinal, or categorical.
Pearson
The Pearson product-moment correlation coefficient, also known as r, R, or Pearson's r, is a measure of the strength and direction of the linear relationship between two variables that is defined as the covariance of the variables divided by the product of their standard deviations. This is the best-known and most commonly used type of correlation coefficient. When the term "correlation coefficient" is used without further qualification, it usually refers to the Pearson product-moment correlation coefficient.
Intra-class
Intraclass correlation (ICC) is a descriptive statistic that can be used, when quantitative measurements are made on units that are organized into groups; it describes how strongly units in the same group resemble each other.
Rank
Rank correlation is a measure of the relationship between the rankings of two variables, or two rankings of the same variable:
- Spearman's rank correlation coefficient is a measure of how well the relationship between two variables can be described by a monotonic function.
- The Kendall tau rank correlation coefficient is a measure of the portion of ranks that match between two data sets.
- Goodman and Kruskal's gamma is a measure of the strength of association of the cross tabulated data when both variables are measured at the ordinal level.
Tetrachoric and polychoric
The polychoric correlation coefficient measures association between two ordered-categorical variables. It's technically defined as the estimate of the Pearson correlation coefficient one would obtain if:
- The two variables were measured on a continuous scale, instead of as ordered-category variables.
- The two continuous variables followed a bivariate normal distribution.
When both variables are dichotomous instead of ordered-categorical, the polychoric correlation coefficient is called the tetrachoric correlation coefficient.
Interpreting correlation coefficient values
The correlation between two variables have different associations that are measured in values such as r or R. Correlation values range from −1 to +1, where ±1 indicates the strongest possible correlation and 0 indicates no correlation between variables.
| r or R | r or R | Strength or weakness of association between variables |
|---|---|---|
| +1.0 to +0.8 | -1.0 to -0.8 | Perfect or very strong association |
| +0.8 to +0.6 | -0.8 to -0.6 | Strong association |
| +0.6 to +0.4 | -0.6 to -0.4 | Moderate association |
| +0.4 to +0.2 | -0.4 to -0.2 | Weak association |
| +0.2 to 0.0 | -0.2 to 0.0 | Very weak or no association |
Notes
References
References
- . ["correlation coefficient"](http://www.ncme.org/ncme/NCME/Resource_Center/Glossary/NCME/Resource_Center/Glossary1.aspx?hkey=4bb87415-44dc-4088-9ed9-e8515326a061#anchorC). *[[National Council on Measurement in Education]]*.
- (1997). "An Introduction to Error Analysis: The Study of Uncertainties in Physical Measurements". University Science Books.
- (2009). "Statistical Methods in Practice: For scientists and technologists". Wiley.
- Weisstein, Eric W.. "Statistical Correlation".
- (1997). "An Introduction to Error Analysis: The Study of Uncertainties in Physical Measurements". University Science Books.
- "The Correlation Coefficient (r)".
This article was imported from Wikipedia and is available under the Creative Commons Attribution-ShareAlike 4.0 License. Content has been adapted to SurfDoc format. Original contributors can be found on the article history page.
Ask Mako anything about Correlation coefficient — get instant answers, deeper analysis, and related topics.
Research with MakoFree with your Surf account
Create a free account to save articles, ask Mako questions, and organize your research.
Sign up freeThis content may have been generated or modified by AI. CloudSurf Software LLC is not responsible for the accuracy, completeness, or reliability of AI-generated content. Always verify important information from primary sources.
Report