I am working on an ML problem. The dataset is of shape (15036, 216) containing all categorical variables. The task is to select the top 10 features which are applicable using any two feature selection techniques and compare those two against one another.
Surfing through internet I came across 2 algorithms which would aid me in this regard
- Chi-Squared Feature Selection
- Mutual Information Feature Selection for Classification
I did use the above two and was able to identify the top 10 relevant features. (From Chi-Square -> Based on Lower P-Values and From Mutual Information -> Based on Higher Mutual Info Values)
Now how do I compare the performance of these 2 algorithms and choose the better one amongst them?