I am doing research on Winsorization (and trimming), which has been broadly applied in many fields, but I think many researchers didn't do it in a "rigorous" way. Or maybe even worse, they misuse it. So I am wondering if there is a well-defined, formalized way to apply Winsorization (or trimming). What appeared in many papers is that the researchers just apply Winsorization when there are some extreme values in their data set. They didn't
- Justify the mechanism of the extreme values (are they legitimate observations or from other contamination distributions).
- Follow the framework of robust statistics (make assumptions about the distribution, define the estimator, a.k.a. Winsorized Estimator, and do inference).
In my opinion, when people are talking about "Winsorization", there are two possible meanings:
- An action to change (Winsorize) the extreme values, but follows a classical statistical inference procedure.
- An estimator (Winsorized Mean estimator) which is defined as a functional on empirical cdfs: $\hat{\theta}=T(\hat{F}_n)$, and follows robust statistical procedure.
For the second, the data doesn't change; we just change the estimators. But for the first, the data is changed and is regarded as real observations. It is like data manipulation, which should be abandoned.
In this sense, any study follows the first procedure should be regarded as a misuse and should be taken with caution. Can I understand in this way?