I have a large dataset (n = 170,000), distributed roughly equally among 5 groups (each is 34,000 ± ~1,000). I want to compare whether the means in any of the first 4 groups are different from the control. Before the data came in, I figured the best approach would be to use Dunnett's test; however, now I'm unsure how best to handle this.
After doing some research, it seems there are at least 2 ways to go about it:
- Normalize the values (which still yields a skewed distribution)
- Use a non-parametric counterpart
I'd like to get an idea of the pros/cons of each, so I can decide on the best approach.