2

I need to find out the regions (could be more than 1) where a classifier performs well on training data. I need to somehow store those regions and for two or more classifiers find out the overlaps of regions of different classifiers. How can I implement this in R or any other libraries? The dataset will definitely be multidimensional.

khorgosh
  • 21
  • 1

1 Answers1

1
  1. Take a large amount of testing data, enough to cover a meaningful amount of your IV space.
  2. Apply your classifier to the testing data. Evaluate it, using your favorite error measure.
  3. Use a regression method to learn how the error of your classifier depends on the input IVs.

Essentially, you would be using a two-step ML procedure, where the second step would be to learn how well the the first step performs.

I recommend that you don't use training data, but test data here. Otherwise you will be very likely to overfit.

Stephan Kolassa
  • 95,027
  • 13
  • 197
  • 357