How to compare the performance of different feature subsets with the same classifier?

Asked Aug 30 '16 at 08:10

Active Sep 06 '16 at 13:30

Viewed 58 times

I have a small dataset (55 samples) described by 20 features.

I performed a SVM (RBF) approach with cross-validation on 70% of the dataset (training part) and I recorded the AUC (average) for 150 combinations of features that may have a sense for the experiment (nevertheless I had tried before feature selection but with no success).

I have very good results (near 99% AUC) for some combinations of features and bad ones 51% for example for others.

My question is which statistical approach I have to use to assess correctly which combinations are better than others ?

edited Sep 06 '16 at 13:30

kjetil b halvorsen

63,378
26
142
467

asked Aug 30 '16 at 08:10

ltor

How to compare the performance of different feature subsets with the same classifier?

0 Answers0