I have read through a large number of stack posts about hypothesis testing on Poisson distributions. Some examples:
- Calculating test statistic of a poisson distribution
- Hypothesis testing with Poisson Data
- hypothesis testing using poisson distribution
The above four posts ask this question in slightly different ways, yet the answers obtained are not recognizably similar. Thus I will ask the question here in hopes of obtaining an authoritative answer.
The question
I have a Poisson distribution with known mean $\mu$. What is the probability of obtaining a single sample from that distribution which is greater than some value $x$?
Relatedly, what is the probability of drawing $n$ samples from that distribution which have an average value greater than $\bar{x} = \sum_i^n{x_i} / n$.?
Since the variance of the Poisson distribution is equal to its mean $\mu$, we may compute a Z-score like:
$$ z = \frac{x-\mu}{\sqrt{\mu}} $$
I have not seen this expressed explicitly anywhere, but I assume the generalization to multiple samples is accomplish like:
$$ z = \frac{\bar{x}-\mu}{\sqrt{\mu / n}} $$
With a Z-score in hand, one can look up the probability using the Z distribution CDF. In python:
import scipy.stats
p = scipy.stats.norm.cdf(z)
Firstly, I am wondering if the above characterization of the approach is correct. This is simply the understanding I have gleaned from searching the topic.
Secondly, where does this approach come from? For example, in the case of an academic paper, whom would I cite about such an approach?
Finally, the normal approximation is sometimes but not always mentioned in posts on this topic. Does the above approach use the normal approximation? As such, is it inappropriate for $\mu$ with smaller values? If so, what would be an appropriate test?