How to find out the regions a classifier performs well in machine learning?

Question

I need to find out the regions (could be more than 1) where a classifier performs well on training data. I need to somehow store those regions and for two or more classifiers find out the overlaps of regions of different classifiers. How can I implement this in R or any other libraries? The dataset will definitely be multidimensional.

What classifier do you use? It certainly depends on this, for example decision trees have this capability out of the box. — Jakub Bartczuk, Sep 05 '17 at 08:18
My problem is for general case. The classifier can be of any type. It's a vital part of my thesis. Thank you for your comment. — khorgosh, Sep 07 '17 at 15:36

score 1 · Answer 1 · answered Feb 08 '18 at 07:47

Take a large amount of testing data, enough to cover a meaningful amount of your IV space.
Apply your classifier to the testing data. Evaluate it, using your favorite error measure.
Use a regression method to learn how the error of your classifier depends on the input IVs.

Essentially, you would be using a two-step ML procedure, where the second step would be to learn how well the the first step performs.

I recommend that you don't use training data, but test data here. Otherwise you will be very likely to overfit.

How to find out the regions a classifier performs well in machine learning?

1 Answers1