For the pdf slides, click here
for normal linear regression
, also called coefficient of determination or multiple correlation coefficient, is defined for normal linear regression, as the proportion of variance “explained” by the regression model
Note that under the MLE, where , the deviance (i.e., negative two times log likelihood) is
- I list this derivation here to make clear that the following generalized contains as a special case for normal linear regression
Generalized by Cox and Snell
Generalized , proposed by Cox and Snell [1989] (and also Magee [1990] and Maddala [1983])
The genralized for more general models where
- the concept of residual variance cannot be easily define, and
- maximum likelihood is the criterion of fit, is
Here, and are the likelihood of the fitted and the null models, respectively.
For normal linear regression, this generalized becomes the classical
Desirable properties of the generalized , as in Eq
Consistent with classical
Consistent with maximum likelihood as an estimation method
Asymptotically independent of the sample size
has an interpretation as the propotion of unexplained “variation”
- For example, if we have three nested models, from smallest to largest, , and , then we have
- For more desirable properties (7 in total), please check out the Nagelkerke[1991] paper
Generalized by Nagelkerke
Generalized , proposed by Nagelkerke [1991]
An undesirable property: for discrete models, the maximum is always less than 1
- This is because the likelihood of discrete target variables are from pmf (rather than from pdf, as of continuous targets)
A new definition of the generalized
Majority of the desirable properties of , including the ones listed on the previous page, are still satisfied
Nagelkerke’s general seems to be a popular version. For example, the biostat textbook by Steyerberg uses this version
Generalized for binary data
Generalized for binary data
Denote the estimated binary probabilities as for the fitted model, and for the null model
Cox and Snell
Nagelkerke
References
Nagelkerke, N. J. D. (1991). A Note on a General Definition of the Coefficient of Determination. Biometrika, 78(3), 691-692.
A nice comparison of different versions of generalized : https://stats.idre.ucla.edu/other/mult-pkg/faq/general/faq-what-are-pseudo-r-squareds/
Steyerberg, E. W. (2019). Clinical prediction models. Springer International Publishing.