Required number for logistic regression

Question

I read that it is recommended that

for LR, a useful rule of thumb comes from simulation experiments which suggest that the number of the less common of the two possible outcomes (“events”) divided by the number of predictor variables should be at least 10, and preferably greater.

I'm not sure if I quite understand this. Does this mean say if the result was death, there were 7 predictor variables, and one of the predictor variables was playing baseball, which 4/100 people did, that 4/7 < 10, and therefor this criteria is not met? If this is not what is meant; is there any other condition for the number of events?

If you have say 100 subjects, 20 of whom die and 80 survive, then according to this rule of thumb you should have no more than 2 covariates (predictor variables) in your model. In fact, with 2 covariates the condition is satisfied: $\frac{\min(20,80)}{2} \ge 10$, while with 3 covariates it is not $\frac{\min(20,80)}{3}<10$ — boscovich, Dec 23 '14 at 21:55
Thank you for clearing that up! Is there a condition for number of events for a given predictor? — Cenoc, Dec 23 '14 at 23:10
In my experience you need more than 15 events per candidate predictor. And note that you need 96 subjects just to estimate the intercept within a margin of error of +/- 0.1 on the risk scale. — Frank Harrell, Dec 23 '14 at 23:13
@FrankHarrell, do you mean you need 96 instances of the lesser of the two events? Eg, intuitively it seems to me N=100, but p(Y=1)=.01, wouldn't likely yield a good estimate. — gung - Reinstate Monica, Dec 23 '14 at 23:40
$n=96$ total number of observations. Thanks for alert on SO. — Frank Harrell, Dec 24 '14 at 03:44
@FrankHarrell How hard-and-fast is the rule I described in the question? Is it generally flexible? Also, can this be formatted as an answer so I can check it? — Cenoc, Dec 24 '14 at 18:54
It all depends on the precision of model accuracy you demand. 15:1 works for me over a wide variety of situations. — Frank Harrell, Dec 24 '14 at 19:39

Required number for logistic regression

0 Answers0