I am doing clustering on a given data. When I plot the distributions of the individual features of this data, I found there are many columns that shows "long tail distribution".
I am wondering should I apply log transformation to these columns ?
If yes, what is the reason to do this?
I tried to conduct feature scaling only, which I would use $\frac{X−X.mean()}{X.std()}$. This should transform the values to be around 0. But I worry after the feature scaling the distribution is still so skewed that, it may be not useful to the clustering task? So I'm wondering if I should apply the log transformation before the feature scaling (standardization).