2

I'm dealing with this big dataset which has:

  • 1 categorical variable with 90 levels that represent some sort of "geographical area"

  • 3 continuous variables

What I'm trying to do is to "aggregate" the levels of the categorical variable, to a maximum of 10.

I don't know exactly which technique I should use to accomplish this, for example whether I should use factor analysis or some clustering (unsupervised)

kjetil b halvorsen
  • 63,378
  • 26
  • 142
  • 467
mariob6
  • 425
  • 2
  • 11

0 Answers0