5

I have a question about confidence interval calculations for the odds ratio $\hat{p}\over{1-\hat{p}}$ from a logistic regression model (perhaps obtained from the method of Generalized Estimating Equations, but that's of secondary importance). It's best asked after a bit of background and an example:

Let's assume we have a simple logistic regression model with a single binary independent variable ($X={0,1} $ ). For $X=1$, we can estimate the log odds by:

$\log({\hat{p}\over{1-\hat{p}}})=\hat{\beta}_0+\hat{\beta}_1$

To obtain a 95% confidence interval (CI) for the odds ratio, textbooks that I've consulted indicate that you calculate the 95% CI for the odds-ratio, ${\hat{p}\over{1-\hat{p}}}$, by:

Lower Bound= $e^{[\hat{\beta}_0+\hat{\beta}_1-1.96Var(\hat{\beta}_0+\hat{\beta}_1)]}$

Upper Bound= $e^{[\hat{\beta}_0+\hat{\beta}_1+1.96Var(\hat{\beta}_0+\hat{\beta}_1)]}$

What I am wondering is if it is appropriate to use the the Delta Method to obtain the variance of the odds ratio $\hat{p}\over{1-\hat{p}}$ $= \exp{(\hat{\beta}_0+\hat{\beta}_1)}$ instead? For example, could obtain a 95% confidence interval using the following method?:

(1) First, exponentiate to obtain the point estimate of the odds ratio: $\hat{p}\over{1-\hat{p}}$$=\exp(\hat{\beta}_0+\hat{\beta}_1)$=$f(\hat{\beta}_0,\hat{\beta}_1)$.

(2) Find the approximte large sample variance of this non-linear point estimate by the Delta methods by finding the gradient of $f$ as: $\nabla f =[\exp(\hat{\beta}_0+\hat{\beta}_1), \exp(\hat{\beta}_0+\hat{\beta}_1)]$

(3) Calculate $Var({\hat{p}\over{1-\hat{p}}})$ $\approx$ $\nabla f \hat{\Sigma} \nabla f^T$. where $\hat{\Sigma}$ is the empirical covariance matrix of $\hat{\beta}$.

(4) Find the approximate standard error, $SE$ by taking the square root of the approximate variance estimate $SE = \sqrt{\nabla f \Sigma \nabla f^T}$

(5) Finally, obtain the 95% CI by $\exp(\hat{\beta}_0+\hat{\beta}_1)\pm Z_{1-.05/2}SE$.

Is the delta method I have described appropriate to find a 95% confidence interval, and if it is, why is it never described in textbooks on logistic regression? If it isn't, why not, when it seems you can use it to linearize it for similar problems.

UPDATE: I updated the step 3 to make it clear that I'm using the estimated, empirical covariance matrix.

UPDATE 2

I still haven't received a very satisfactory answer to this question. It has even made me wonder why people don't use the Delta Method to compute confidence intervals around estimates of just $\hat{p}$ now, instead of talking the inverse of the logit link of the upper and lower confidence intervals. Maybe @FrankHarrel can shed some light as he seems especially knowledgeable about these types of calculations?

StatsStudent
  • 10,205
  • 4
  • 37
  • 68
  • if you use R , the function 'glm' with family is binary, then if your estimated model is m –  Apr 05 '16 at 07:45
  • I think you will find that Stata uses the delta method to report standard errors for odds ratios from logistic regression but estimates the confidence intervals on the log scale. This often confuses people as they do not agree. – mdewey Apr 05 '16 at 10:28
  • @fcop and mdewey, can you comment on whether or not the steps I've listed above seem like a valid approach to modelling the confidence intervals? – StatsStudent Apr 05 '16 at 16:09
  • @Analyst1: can you explain where you take the covariance between $\hat{\beta}_i$ into account and how you estimate it? –  Apr 05 '16 at 17:43
  • @fcop, the covariance between the $\hat{\beta}_i$ are taken into account in step 3 (and 4), with $\hat{\Sigma}$. $\hat{\Sigma}$ is the empirical covariance matrix of the $\hat{\beta}_i$'s. Is this what you are asking? – StatsStudent Apr 05 '16 at 17:58
  • What do you mean by 'empirical covariance matrix of the $\hat{\beta}_i$', how do you compute that? –  Apr 05 '16 at 18:00
  • Sorry I didn't make that clear earlier. I'm actually fitting the logistic model using the method of Generalized Estimating Equations which makes use of Huber-White Sandwhich estimators. With this approach, you can use model-based or empirical-based estimates of the covariance matrix, but that's really of secondary importance. The import thing that's relevant to this post, i think, is that $\hat{\Sigma}$ is the covariance matrix of the $\hat{\beta}_i$'s. – StatsStudent Apr 05 '16 at 18:02
  • For those curious about the Huber-White Sandwich estimators, can check out this nice write-up: http://www.stat.berkeley.edu/~census/mlesan.pdf – StatsStudent Apr 05 '16 at 18:04
  • Why don't you use the fact that $\hat{\beta}_0+\hat{\beta}_1 \sim N(\beta_0+\beta_1,\sigma_1^2+\sigma_0^2+2\sigma_{01}$? –  Apr 06 '16 at 05:48
  • Would this just amount to the same thing as the Delta method if I used this? – StatsStudent Apr 06 '16 at 06:38
  • I think it will be different –  Apr 06 '16 at 17:53
  • @StatsStudent have a look at my answer [here](https://stats.stackexchange.com/questions/29044/plotting-confidence-intervals-for-the-predicted-probabilities-from-a-logistic-re/354854#354854) to see if you find it useful - I can post it on this thread instead – Xavier Bourret Sicotte Jul 09 '18 at 06:57

1 Answers1

1

I do not think the method you outline will be widely accepted because (a) the odds ratio is not symmetric about its null value of unity whereas the log odds ratio is symmetric about its null value of zero and your method gives intervals symmetric on the odds ratio scale (b) your method is capable of giving an interval which includes impossible values (negative) for the lower limit.

mdewey
  • 16,541
  • 22
  • 30
  • 57
  • Thank you. What if I were comparing the difference of two proportions? estimated from a logistic regression model? Would that method seem more plausible with regard to (b)? In other words, could I carry out essentially the same process, only with a modified estimating function such as $\hat{p}_1\over{1-\hat{p}_1}$ - $\hat{p}_2\over{1-\hat{p}_2} $ and modified gradient? – StatsStudent Apr 05 '16 at 16:38
  • I should have said, the difference of two odds ratios in my comment above or the difference of two proportions (and provided the estimating function $\hat{p}_1-\hat{p}_2$. Sorry for the confusion. – StatsStudent Apr 05 '16 at 17:00
  • You are still facing the same problems if you work on the absolute scale. Do you think the difference between an OR of 2 and 3 is the same as between 10 and 11? The usual route is to work in terms of relative odds ratios by fitting a logistic regression and then exponentiating the estimate and the limits of the confidence interval. – mdewey Apr 05 '16 at 20:55
  • I understand that the concerns with the absolute scale and they are duly noted. But I'm more concerned with interpretability by laymen than the difficulties encountered using the absolute scale. – StatsStudent Apr 05 '16 at 22:25