can we trust feature importance of a poor model

Question

Usually when training ensemble learning algorithms or a regression model we calculate feature importance and make conclusion that feature with highest feature importance has largest affect on y-variable.

No doubt when model itself if reliable(High MSE or accuracy) however what if model is not performing well? Can we still trust feature importance?

You can trust it as much as you trust the predictive accuracy. — user2974951, Oct 20 '21 at 07:36

Tim · Accepted Answer · 2021-10-20T12:27:17.873

3

The key consideration is that feature importance is always an importance for the particular model and the particular dataset. There is no such a thing as global importance. If you use different feature selection algorithms, you can expect that they would pick different features, hence consider different features to be important.

So if you have a poorly performing model, than feature importance tells you that the feature is important for the model when it makes its (poor) predictions. It doesn't mean that there is no other, better performing model, that possibly assigns less or more importance to the feature.

edited Oct 20 '21 at 12:27

answered Oct 20 '21 at 09:32

Tim

108,699
20
212
390

Thanks! Is there reference that proves your statement? – haneulkim Oct 21 '21 at 02:40
@haneulkim you have results of an experiment in the linked thread, for importance in general, it is calculated using some model, so it cannot be independent of the underlying model. – Tim Oct 21 '21 at 05:33

can we trust feature importance of a poor model

1 Answers1

Linked