Suppose I want to regress $X$ on $y$, controlling for categorical $z$ with $100$ different levels. I believe that linear regression is appropriate. Normally I would create dummies $D_i$ for each category of $z$ and run a least squares regression of the model
$$y = X\beta + \Sigma_{i=1}^{100}\delta_iD_i +\epsilon $$
However suppose $z$ is such that the first two categories $D_1$ and $D_2$ represent 49.5% of the data each and $D_{3:100}$ account for only 1%. So I merge everything but the first 2 categories together into one category, so that $z$ is now only 3 levels.
After doing this, can I still say I am controlling for $z$? It feels like since the individual coefficients for $D_{3:100}$ are no longer identifiable I'm losing something.