I was wondering if anyone could help me with information about kurtosis (i.e. is there any way to transform your data to reduce it?)
I have a questionnaire dataset with a large number of cases and variables. For a few of my variables, the data shows pretty high kurtosis values (i.e. a leptokurtic distribution) which is derived from the fact that many of the participants gave the exact same score for the variable. I do have a particularly large sample size, so according to the central limit theorem, violations of normality should still be fine.
The problem, however, is that fact that the particularly high levels of kurtosis are producing a number of univariate outliers in my dataset. As such, even if I transform the data, or remove/adjust the outliers, the high levels of kurtosis mean that the next most extreme scores automatically become outliers. I aim to use (discriminant function analysis). DFA is said to be robust to departures from normality provided that the violation is caused by skewness and not outliers. Furthermore, DFA is also said to be particularly influenced by outliers in the data (Tabachnick & Fidel).
Any ideas of how to get around this? (My initial thought was some way of controlling the kurtosis, but isn't it kind of a good thing if most of my sample are giving similar ratings?)