Is there any consensus over whether one needs to drop first when performing one-hot encoding.
With reference to here and here, I am thinking that only when you are using OLS then you need to leave one column out due to singularity issue, and thus, if you use regularized, or non-linear models, then there is no need to do this. I just want to clarify if there is any dummy trap here?