I am no statistician, but I am among the more statistically inclined at my workplace, so my coworkers consult me from time to time. I don't alway give the right answer, but I'd like to think I give reasonable answer most of the time. This time, I couldn't find exactly what I was looking for on the internet, so I come here for help.
At our laboratory, they have received 23 samples and tested for a pathogen. 16 came out positive. In the general population, they would expect the prevalence to be about 10%, so this is rather high. They want to be sure, thoguh, and therefore want to design a control study. The main issue I was asked about was the control sample size. They didn't want a large control size, so I was asked to look into, for different control sizes and different control results, how strong would the result be.
So the first thing I found was the test statistic $$ z = \frac{\hat p_1 - \hat p_2}{\sqrt{\hat p(1-\hat p)\left(\frac1{n_1}+ \frac1{n_2}\right)}} $$ but it came with the stipulation that both trials have more than five successes and more than five failures. And if we expect a control prevalence of 10%, that means we should have at least 50 controls. That's more than the lab guys expect.
And I couldn't find any info on the internet to help me with the small sample size. So, this is what sounds reasonable to me:
Let the null hypothesis be that the control samples and the case samples come from the same Bernoulli distribution. Let the control sample size be $n$, and the control results be $k$ positive. Then we have $23+n$ samples from a Bernoulli trial, and the first $23$ of them had $16$ of the $16+k$ positive results.
The probability that at least $16$ of the positive results happen to appear among the first $23$ samples is $$ \sum_{j = 0}^{k}\frac{\displaystyle\binom{23}{16+j}\cdot \binom{n}{k-j}}{\displaystyle\binom{23+n}{16+k}} $$
To a non-statistician like me, that looks like a (one-tailed) $p$-value. And as the numbers are small, it's rather easy to compute.
I am pretty certain that I have missed something, because this is not my area of expertise. The fact that this is one-tailed is one of those things. Is this approach salvageable, or am I completely off the mark?