Interesting question on confidence intervals

Question

Say I conducted a sample and did the relevant processing to get a 95% confidence interval of a parameter to fall between a [10,14].

My friend did another sample, and got a x% confidence interval of a parameter to fall between [11,13].

Is x = 95, <95 or >95?

Edit: Made a mistake here. The friend was supposed to use the same data in the qns so answer should be x< 95% :D

Re the edit: the answer is *still* not determined: your friend might have used a more powerful procedure than you did. — whuber, Oct 23 '20 at 14:23
@Sextus That seems unlikely when that procedure shrank the interval so much, but it is possible in principle. — whuber, Oct 23 '20 at 15:30
@whuber so you are suggesting that [10,14] wasn't truly a 95% confidence interval after all (creating boundaries more wide than necessary)? — Sextus Empiricus, Oct 23 '20 at 15:38
@Sextus Not at all: it's perfectly possible for a statistical problem to have many admissible confidence interval procedures as well as many more inadmissible ones. The sense of "creating boundaries more wide than necessary" suggests there exists a unique admissible procedure, but that's rarely the case. A standard example is an iid sample from a lognormal distribution with unknown geometric mean and sd. — whuber, Oct 23 '20 at 15:41
@whuber I am not sure how you argue that the method which leads to [11,13] indicates more power than the method that leads to the [10,14] interval. If the one method gives larger intervals for *all* samples then the intervals do not have the same confidence level. If the one method gives larger intervals only for some samples (and it will compensate with smaller intervals for other samples) then we can not know which one is more powerful than the other just because the interval of one method is smaller than the other in one case (since for a different sample it will be an opposite situation). — Sextus Empiricus, Oct 23 '20 at 16:09
@Sextus Are you perhaps implicitly assuming both methods are using the same model for the data?? — whuber, Oct 23 '20 at 16:12
Yes, I was assuming both methods are using the same model for the distribution/probability of the data. — Sextus Empiricus, Oct 23 '20 at 16:20

score 6 · Answer 1 · answered Oct 22 '20 at 21:28

6

The answer is underdetermined.

The latter interval is narrower, but that could be explained by smaller sample variance (say by chance) or by the use of a different quantile (i.e. different $\alpha$ and thus different confidence).

answered Oct 22 '20 at 21:28

Demetri Pananos

24,380
1
36
94

Sextus Empiricus · Accepted Answer · 2020-10-24T12:03:53.600

Demetri Pananos argued that the question is underdetermined based on the idea that the sample might be different. However, even when the same sample is used (as clarified on your edit) then we can still not know whether the confidence interval [11,13] is a lower or higher % confidence interval than the confidence interval [10,14].

Confidence intervals are not defined uniquely, (and similar % intervals may provide different sizes of intervals)

For a particular observation one method may result in a shorter confidence interval than another method, but that says nothing about the overall confidence level of that confidence interval (the method of the friend might result in larger intervals for other samples, we only know that for this particular sample the one interval is shorter than the other). Related question: What is a rigorous, mathematical way to obtain the shortest confidence interval given a confidence level?
Confidence intervals are not conditional on the sample. The interval [10,14] might have more probability to contain the parameter than the interval [11,13]. However, the confidence is not the probability that the interval contains the parameter. Related question: Why does a 95% Confidence Interval (CI) not imply a 95% chance of containing the mean?

Example

Below is an example that is a bit far-fetched, and this answer is pedantic. But it does illustrate how confidence intervals should be interpreted.

Say we compute the mean of a sample from a normal distributed population that is parametized by the 4-th root $\theta^{1/4}$ of $\theta$. Let the sample distribution of the mean be:

$$\bar{x} \sim \mathcal{N(\mu_{\bar{x}} = \text{sign}(\theta)\text{abs}(\theta)^{1/4}\, ,\, \sigma_{\bar{x}} = 1})$$

this expression $\text{sign}(\theta)\text{abs}(\theta)^{1/4}$ is like $\theta^{1/4}$ but also has a solution for negative $\theta$.

The image below shows how we can create different 95% confidence intervals for this case (see for more about this: The basic logic of constructing a confidence interval). We sketch two cases.

In one case, the solid lines, boundaries are chosen by the upper and lower 2.5% test intervals.
In the other case, the broken line, the boundaries are chosen by the upper 5% for $\theta<0$ and the lower 5% for $\theta>0$

In both these cases, the confidence intervals will contain the true value 5% of the time, independent of the true parameter $\theta$.

If we would observe $\bar{X} = 0$, then we would have two different intervals approximately $[-7.320,7.320]$ and $[-14.757,14.757]$. That's a difference by a factor two, just like your case.

I agree that this answer is pedantic, and shows how the question is technically underdetermined. But in practice, you will not find the situation of the example above. In a 'normal' situation we the confidence intervals will not be constructed in these strange/extreme ways and when one confidence interval is smaller (for the same data) then it typically relates to a smaller % confidence. But, as this answer shows, it is not necessary.

This is truly educational. Thank you. I don't think any of my Profs can make a lesson this interesting. — Joel Tan, Nov 05 '20 at 13:04
@JoelTan to be honest, I find the intuition that [11,13] represents a confidence interval with a smaller x% than [10,14] not so bad. In practice, this difference, a factor 2, is large and you would not encounter this unless the confidence level is different. So, my answer here is only to show the theoretical possibility that the confidence levels for these two different intervals can be the same. In practice, you will not get a situation with such a strong difference in intervals for the same confidence levels. *(and that is why my teachers often did not like me)* — Sextus Empiricus, Nov 05 '20 at 13:10

quester · Answer 3 · 2020-10-24T16:06:05.660

1

https://besjournals.onlinelibrary.wiley.com/doi/10.1111/1365-2656.12382

please read this article, here you will see differences in different calculation methods for confidence interval,

of course different samples yield different results in same method

in the end you can't point any answer, samples alone can distort heavily answers for different methods (+numerical errors)

edited Oct 24 '20 at 16:06

answered Oct 24 '20 at 11:08

quester

472
3
12

Interesting question on confidence intervals

3 Answers3

Example