Skip to content
Surf Wiki
Save to docs
general/correlation-indicators

From Surf Wiki (app.surf) — the open knowledge base

Coefficient of multiple correlation

Statistical concept


Statistical concept

In statistics, the coefficient of multiple correlation is a measure of how well a given variable can be predicted using a linear function of a set of other variables. It is the correlation between the variable's values and the best predictions that can be computed linearly from the predictive variables.

The coefficient of multiple correlation takes values between 0 and 1. Higher values indicate higher predictability of the dependent variable from the independent variables, with a value of 1 indicating that the predictions are exactly correct and a value of 0 indicating that no linear combination of the independent variables is a better predictor than is the fixed mean of the dependent variable.

The coefficient of multiple correlation is known as the square root of the coefficient of determination, but under the particular assumptions that an intercept is included and that the best possible linear predictors are used, whereas the coefficient of determination is defined for more general cases, including those of nonlinear prediction and those in which the predicted values have not been derived from a model-fitting procedure.

Definition

The coefficient of multiple correlation, denoted R, is a scalar that is defined as the Pearson correlation coefficient between the predicted and the actual values of the dependent variable in a linear regression model that includes an intercept.

Computation

The square of the coefficient of multiple correlation can be computed using the vector \mathbf{c} = {(r_{x_1 y}, r_{x_2 y},\dots,r_{x_N y})}^\top of correlations r_{x_n y} between the predictor variables x_n (independent variables) and the target variable y (dependent variable), and the correlation matrix R_{xx} of correlations between predictor variables. It is given by

where \mathbf{c}^\top is the transpose of \mathbf{c}, and R_{xx}^{-1} is the inverse of the matrix ::R_{xx} = \left(\begin{array}{cccc} r_{x_1 x_1} & r_{x_1 x_2} & \dots & r_{x_1 x_N} \\ r_{x_2 x_1} & \ddots & & \vdots \\ \vdots & & \ddots & \\ r_{x_N x_1} & \dots & & r_{x_N x_N} \end{array}\right). If all the predictor variables are uncorrelated, the matrix R_{xx} is the identity matrix and R^2 simply equals \mathbf{c}^\top\, \mathbf{c}, the sum of the squared correlations with the dependent variable. If the predictor variables are correlated among themselves, the inverse of the correlation matrix R_{xx} accounts for this. The squared coefficient of multiple correlation can also be computed as the fraction of variance of the dependent variable that is explained by the independent variables, which in turn is 1 minus the unexplained fraction. The unexplained fraction can be computed as the sum of squares of residuals—that is, the sum of the squares of the prediction errors—divided by the sum of squares of deviations of the values of the dependent variable from its expected value. ## Properties With more than two variables being related to each other, the value of the coefficient of multiple correlation depends on the choice of dependent variable: a regression of y on x and z will in general have a different R than will a regression of z on x and y. For example, suppose that in a particular sample the variable z is uncorrelated with both x and y, while x and y are linearly related to each other. Then a regression of z on y and x will yield an R of zero, while a regression of y on x and z will yield a strictly positive R. This follows since the correlation of y with its best predictor based on x and z is in all cases at least as large as the correlation of y with its best predictor based on x alone, and in this case with z providing no explanatory power it will be exactly as large. ## References ## References 1. [http://onlinestatbook.com/2/regression/multiple_regression.html Introduction to Multiple Regression] 2. [http://mtweb.mtsu.edu/stats/regression/level3/multicorrel/multicorrcoef.htm Multiple correlation coefficient] ::callout[type=info title="Wikipedia Source"] This article was imported from [Wikipedia](https://en.wikipedia.org/wiki/Coefficient_of_multiple_correlation) and is available under the [Creative Commons Attribution-ShareAlike 4.0 License](https://creativecommons.org/licenses/by-sa/4.0/). Content has been adapted to SurfDoc format. Original contributors can be found on the [article history page](https://en.wikipedia.org/wiki/Coefficient_of_multiple_correlation?action=history). ::
Want to explore this topic further?

Ask Mako anything about Coefficient of multiple correlation — get instant answers, deeper analysis, and related topics.

Research with Mako

Free with your Surf account

Content sourced from Wikipedia, available under CC BY-SA 4.0.

This content may have been generated or modified by AI. CloudSurf Software LLC is not responsible for the accuracy, completeness, or reliability of AI-generated content. Always verify important information from primary sources.

Report