Imagine that we are conducting an experiment to test the effectiveness of a treatment, where the «level of illness» is measured by a count that is distributed as a negative binomial (NB). The plan is to use a mixed GLM for NB distributed counts.
First, it doesn't appear to make sense to actually treat the people who are not sick at all, so you would want to remove the $0$s from the initial distribution and see how the treatment affects sick people. I first thought that this was no big deal because you can shift the distribution by $-1$ and what you get still reasonably fits a NB distribution. The problem is that you would need to consistently shift the distribution after the treatment, which means that cured people would fall at $-1$, which doesn't make sense from a NB point of view.
So I can only see two options:
- Actually include non-sick people in the initial sample (which is not as bad as it sounds from a practical point of view as we are dealing with minor mental issues and non-invasive treatments so this would be technically possible).
- Keep the truncated initial sample, with no shift, and be able to argue that it's still ok to use a GLM on count data where the distribution has $0$s truncated off at $t=0$.
My question is: does option $2$ look reasonable or is it really that bad?
Thanks for any ideas or comments.