Can someone explain me the problems associated with using more than 20 levels in a single dummy variables.
I am aware of the negative implications of using several dummies variables in a model. However recently I came across a reading that mentioned usage of several level with a single dummy category also has negative implications.i.e it can affect the estimates of other predictor coefficients.
When I experimentally tried it with a data set, I found that coefficients of other predictors are indeed distorted to a larger scale if I cut the number of levels from 20 to 15. ( Data set: I had 3 predictors and one dummy variable having 20 levels)
Can someone explain shortcomings of multilevel dummies.