To the best of my knowledge, the dependent variable (outcome) in a negative binomial regression should ideally represent count data. In R, if the interest is to model rates, I understand the use of +offset(log(population))
in glm.nb
or glmmadb
.
My question is related to conditions for other covariates. Are incidence/prevalence rates appropriate for additional independent variables? For example, if I'm trying to control for the incidence rate of diabetes per 100,000 within a group, could that be an acceptable independent variable in the negative binomial model?