Comparison between Nested and Non-nested Model: BIC and LRT

Question

I would like to select the best model for predicting breast cancer risk, specifically, it is the comparisons between weight/BMI/height, as other covariates remain the same in all the models. But I got opposite results for nested model selection by using BIC and LRT. Say that the P-value of LRT is <0.05, but the BIC of the richer model is larger than the less rich model, and the delta BIC is larger than 2, which suggests positive evidence.

Q1: Should I use BIC for nested model comparison? Can BIC penalize the overfitting issue adequately for the richer models?

Q2: Can I use BIC for non-nested model selection and LRT for nested model selection? For example, using BIC to select the best model among weight-only/BMI-only/height-only models; using LRT to select the best model between weight+BMI model Vs BMI-only model to see if weight added additional significant information to the BMI-only model.

I know there are many posts about model selections using BIC/AIC/LRT. But none of them really solve my question.

See the answer here: https://stats.stackexchange.com/questions/198799/aic-versus-likelihood-ratio-test-in-model-variable-selection. It concerns AIC rather than BIC but the answer is the same. — Spätzle, Aug 12 '21 at 04:15
I've read that post too, and I raised a question there. I did not get the answer to my question... — Zhoufeng, Aug 12 '21 at 04:18
It still depens on your purpose: If you're just looking to find a better explaining model, then surely you should go by the BIC. — Spätzle, Aug 12 '21 at 04:21
I'm looking for a better predictor, BMI/weight. In all the fitted models, the weight-only model had the smallest BIC. what is confusing to me is that the BIC of the BMI-only model is 3-unit smaller than the BMI+weight model, which means BMI is a better predictor than BMI and weight combined. However, the LRT for BMI+weight Versus BMI is significant, which means weight added additional information to the BMI model. How would you explain these results? — Zhoufeng, Aug 12 '21 at 04:45

score 1 · Answer 1 · answered Aug 12 '21 at 04:58

by definition, BMI and weight are related ($BMI=w/h^2$). Taking them together does not add a significant amount of information to your model, hence the unimpressive BIC difference. In large sample sizes, you would almost always find the additional variable significant - that's a key problem of the traditional significance testing and specifically looking only at the p-value.

Recall that p-value is not "the probability that $H_0$ is wrong" but rather "the probability for rejecting $H_0$ in favor of a powerful alternative $H_1$", and then think of how to compute the power of $H_1$.

Nah, I'm kidding. You don't really need to compute it. Just keep in mind that a p-value does not say anything by itself and that with great sample sizes comes great responsibility for being cautios in interpreting the results.

Thanks for your answer. If the weight model is the best based on BIC. And the BIC of weight+BMI model is smaller than the BMI model. Would that make sense to compare the goodness of fit of the two models based on LRT? The result is that the p value is less than 0.05 which could mean weight added additional information to BMI model, If we assume the importance of LRT.. — Zhoufeng, Aug 12 '21 at 13:12

Comparison between Nested and Non-nested Model: BIC and LRT

1 Answers1

Linked