From Surf Wiki (app.surf) — the open knowledge base
Fraction of variance unexplained
Statistical noise
Statistical noise
In statistics, the fraction of variance unexplained (FVU) in the context of a regression task is the fraction of variance of the regressand (dependent variable) Y which cannot be explained, i.e., which is not correctly predicted, by the explanatory variables X.
Formal definition
Suppose we are given a regression function f yielding for each y_i an estimate \widehat{y}_i = f(x_i) where x_i is the vector of the ith observations on all the explanatory variables. We define the fraction of variance unexplained (FVU) as:
:\begin{align} \text{FVU} & = {\text{VAR}\text{err} \over \text{VAR}\text{tot}} = {\text{SS}\text{err}/N \over \text{SS}\text{tot}/N} = {\text{SS}\text{err} \over \text{SS}\text{tot}} \left( = 1-{\text{SS}\text{reg} \over \text{SS}\text{tot}} , \text{ only true in some cases such as linear regression}\right) \[6pt] & = 1 - R^2 \end{align}
where R2 is the coefficient of determination and VARerr and VARtot are the variance of the residuals and the sample variance of the dependent variable. SS**err (the sum of squared predictions errors, equivalently the residual sum of squares), SS**tot (the total sum of squares), and SS**reg (the sum of squares of the regression, equivalently the explained sum of squares) are given by
:\begin{align} \text{SS}\text{err} & = \sum{i=1}^N;(y_i - \widehat{y}i)^2\ \text{SS}\text{tot} & = \sum_{i=1}^N;(y_i-\bar{y})^2 \ \text{SS}\text{reg} & = \sum{i=1}^N;(\widehat{y}i-\bar{y})^2 \text{ and} \ \bar{y} & = \frac 1 N \sum{i=1}^N;y_i. \end{align}
Alternatively, the fraction of variance unexplained can be defined as follows:
: \text{FVU} = \frac{\operatorname{MSE}(f)}{\operatorname{var}[Y]}
where MSE(f) is the mean squared error of the regression function ƒ.
Explanation
It is useful to consider the second definition to understand FVU. When trying to predict Y, the most naive regression function that we can think of is the constant function predicting the mean of Y, i.e., f(x_i)=\bar{y}. It follows that the MSE of this function equals the variance of Y; that is, SSerr = SStot, and SSreg = 0. In this case, no variation in Y can be accounted for, and the FVU then has its maximum value of 1.
More generally, the FVU will be 1 if the explanatory variables X tell us nothing about Y in the sense that the predicted values of Y do not covary with Y. But as prediction gets better and the MSE can be reduced, the FVU goes down. In the case of perfect prediction where \hat{y}_i = y_i for all i, the MSE is 0, SSerr = 0, SSreg = SStot, and the FVU is 0.
References
References
- Achen, C. H.. (1990). "'What Does "Explained Variance" Explain?: Reply". Political Analysis.
This article was imported from Wikipedia and is available under the Creative Commons Attribution-ShareAlike 4.0 License. Content has been adapted to SurfDoc format. Original contributors can be found on the article history page.
Ask Mako anything about Fraction of variance unexplained — get instant answers, deeper analysis, and related topics.
Research with MakoFree with your Surf account
Create a free account to save articles, ask Mako questions, and organize your research.
Sign up freeThis content may have been generated or modified by AI. CloudSurf Software LLC is not responsible for the accuracy, completeness, or reliability of AI-generated content. Always verify important information from primary sources.
Report