Assume I have 1000 responses from some count data, where each response follows the Poisson distribution with a mean (and variance) falling somewhere in the large range of 1 - 100. There is one explanatory variable that is roughly the mean of each response. Should this be the perfect case of using Poisson regression?
When I follow the usual diagnostic tests found here, they all point to the fact that the data is overdispersed. I assumed overdispersion only occurs when each response variable has a variance greater than its mean. Is this incorrect? As the response variables are taken from the Poisson distribution shouldn't this mean that the data is not overdispersed at all?
Appreciate I could be way off with my understanding here. Any help is greatly appreciated.
The R code I used to simulate the dataset is given below:
dist_length <- 1000
counts <- rep(0, dist_length)
lambdas <- rep(0, dist_length)
for (i in 1:dist_length)
{
lambda <- runif(1, 1, 100)
lambdas[i] <- lambda + runif(1, -0.1, 0.1)
counts[i] <- rpois(1, lambda)
}
poiss_dist <- data.frame('Count'=counts, 'Mean'=lambdas)