I am trying to build a linear regression model.
I have some high cardinal categorical features on which I want to apply target encoding. But my target (real-valued) variable distribution is highly right skewed, so I will apply some transform to get rid of skew.
Which of the following approach is sensible :
I should transform my target variable first and then apply target encoding on categorical feature based on transformed target.
I should apply target encoding on categorical feature based on original target. After that I should apply skew removal on my target variable.
Thanks in advance..