1

RE: Model selection using misclassification rate in forward selection of logistic regression equation

A small misclassification error is good. Keep that factor in the model when doing logistic regression forward selection?

kjetil b halvorsen
  • 63,378
  • 26
  • 142
  • 467
  • 1
    That is probably a very bad idea. Logistic regression is not a classifyer, misclassification is not a proper score function. Search this site for "model selection logistic regression". There are already many good posts! – kjetil b halvorsen Dec 12 '16 at 00:51
  • Your reply discussed a score function. I looked at the other posts briefly. Exactly what do you mean by a score function? Can you give a definition? – Mary A. Marion Dec 13 '16 at 05:35
  • https://en.wikipedia.org/wiki/Scoring_rule – kjetil b halvorsen Dec 13 '16 at 13:05
  • I found: The score function, (), is the derivative of function (|) with respect to the parameters. Thus it has to do with the maximum likelihood estimators. Misclassification error for logistic regression is thus not the best way to evaluate logistic regression models. My question then becomes is it necessary to use maximum likelihood for model comparisons? – Mary A. Marion Dec 13 '16 at 14:16
  • The wiki article I referenced above is not about *that* score function, it is a more general concept! Please read it! – kjetil b halvorsen Dec 13 '16 at 14:31
  • http://stats.stackexchange.com/questions/95250/logistic-regression-maximum-likelihood-vs-misclassification/103893#103893 http://stats.stackexchange.com/questions/207274/probabilistic-classification-and-loss-functions/207286#207286 http://stats.stackexchange.com/questions/87137/relation-between-scoring-rule-and-loss-function-in-parameter-estimation-and-mode/116075#116075 http://stats.stackexchange.com/questions/145875/alternative-notions-to-that-of-proper-scoring-rules-and-using-scoring-rules-to/147426#147426 .... – kjetil b halvorsen Dec 13 '16 at 14:39
  • http://stats.stackexchange.com/questions/127042/why-isnt-logistic-regression-called-logistic-classification/127044#127044 http://stats.stackexchange.com/questions/47058/importance-of-variables-in-logistic-regression/47066#47066 http://stats.stackexchange.com/questions/215154/variable-selection-for-predictive-modeling-really-needed-in-2016 – kjetil b halvorsen Dec 13 '16 at 14:45
  • 1
    Done. I saw it earlier. I am now convinced. I ran 16 models using the misclassification error for model selection. I have been trying to be consistent in the use of statistics. The information about the error is still useful – Mary A. Marion Dec 14 '16 at 19:36

0 Answers0