0

I have used two regression models from ML for a specific problem. Now I want to study some features importance using the the partial dependence plot "PDP". My question is: do I need to obtain a PDP for each regression model? or I will get similar plots?

Thanks

jojo
  • 153
  • 4

1 Answers1

1

I'm assuming by PDP you refer to "Partial dependence plots". If so:

  1. You should expect to get straight lines as the models are linear
  2. If the models contain different variables then you should expect to get different line slopes for the same variables
  3. PDP is usually not the best tool for feature importance study. In the case of linear regression you would probably want to either look at the variable coefficient P-value or measure the reduction in accuracy measure associated with the removal of each variable. You can read some more about feature importance measures here
Iyar Lin
  • 141
  • 3
  • Yes indeed, I have used ANN, DNN, SVR, RF to solve two nonlinear regression problems. and I have added a new feature (the same) for each problem, the accuracy for the first problem gets better, where for the second problem it does not change. So I thought to used PDP to study the dependency plot of this feature for both problem to try to understand its importance. – jojo Oct 08 '21 at 10:46