In addition to the question, please do correct anything below that I have incorrect.
I understand how the test statistics for a t-test and WRS/MWU test are calculated. In a t-test, $t=\frac{\bar x_{1}-\bar x_{2}}{SE}$, where the formula for $SE$ comes from a normal approximation, which is justified by the central limit theorem.
Even though the central limit theorem is used to justify generating a sampling distribution using a normal approximation, I understand from other posts on this site that the CLT isn't always guaranteed to generate a normal sampling distribution from non-normal date. It is my understanding that this is why normality is still a standard assumption for a t-test.
On the other hand, the calculation of $W$ or $U$ in a WRS/MWU is obviously a little more involved but does not involve any invocation of a distribution up to this point. Thus, why this test is considered distribution free.
However, once you try to calculate a p-value from a WRS/MWU test, I am confused as to how it remains different from a t-test in it's reliance on normality.
Just looking at wikipedia, the process for generating a p-value from $U$ simply involves generating a Z-score from the standard form $Z = \frac{U - m_{u}}{\sigma_{u}}$, where $m_{u} = \frac{n_{1}n_{2}}{2}$ and $\sigma_{u} = \sqrt{\frac{n_{1}n_{2}(n_{1} + n_{2}+ 1)}{12}}$, the latter two equations being derived from a normal approximation, when the sample sizes are "large enough." This sounds a lot like the CLT to me.
At this point, the steps for obtaining a p value seem the same between a t-test and a WRS/MWU test:
- The t-score in the t-test and z-score in WRS/MWU are both obtained using sampling distributions calculated from normal approximations
- These scores are then used on sampling distributions (t-distribution vs. normal distribution) to obtain the probability of an equal or more extreme score given the null hypothesis
Therefore, given that:
- Both these test utilize normal approximations to obtain p-values
- That the justification for both of these approximations seem to come from the CLT
- The reason that normality is an assumption for the t-test is because it is still possible for the CLT to be affected by non-normal data
Why is normality not also an assumption for p-values calculated from a WRS/MWU test?
Thanks for your help!