My confusion stems from reading the following paper
http://www.bmj.com/content/351/bmj.h3868
It states in its abstract (and they later show an empirical study that conforms to the claim) - "Overfitted models tend to underestimate the probability of an event in low risk patients and overestimate it in high risk patients"
I'm confused with regards to the generality of this statement. I've seen many cartoons/figures depicting overfitting as a model modelling/capturing noise, and it isn't intuitively obvious to me how that noise would necessarily lead to overestimates of risk in high risk and underestimates of risk in low risk patients. Why can't an overfitted model capture noise in a way that it underestimates the risk in high risk patients? Is there a mathematic proof for their statement?
For the second part of my question, I would like to ask, if the claim they make is true, would calibration plots of underfitted models go the other way (slope<1 for observed v/s predicted; underestimating high risk risks, and overestimating low risk risks?). Again I can't intuitively predict why a simpler underfitting model would necessarily generate a predictable calibration curve.