I have a skew variable and I want to use box-cox transformation (log) but around 50 observation out of 200 are between 0 and 1.
My initial thought was to add a constant to all observations before taking the log. Is it a good idea?
If I do add this constant, how can I determine the back transformation function?