My goal is to predict taxi demand in NYC depending on location and time. I have a dataset with ~18 million observations. With that said, I could add a large number of predictors.
But when would I run into the curse of dimensionality. E.g. adding dummies for all tracts in NYC, would result in 2165 (n-1) additional predictors.