I come accross the relaimpo package in R with the hope of using it to assess the importance of the regressors in a linear regression problem. I am interested in understanding how to relate the output of the relaimpo, e.g. lmg method to the coeficients of the models displayed as summary(model). I was reading the info in the package relaimpo and also followed this link Importance of predictors in multiple regression: Partial $R^2$ vs. standardized coefficients.
In the package for instance, there is an example where only numerical regressors are used:
One can see that the regressor Examination is negative and not significant. The author runs relaimpo with lmg and the output is below:
I thought the output of relaimpo is used to rank the regressor importance to the output, namelly: Education (1), Examination(2), InfantMortality (3), Catholic(4), Agriculture (5). Not sure I get it right because here Examination seems to have an important role (second). I know the output is only positive as they will explain the variance.
My questions:
- What are the most influential regressors in this case?
- Could one say that the Examination regressor has indeed an important impact and it is NEGATIVE?
- supposing I use also categorical regressors, and I got a rank from relaimpo. does this mean that each level is important?
- Should I use the ranking from relaimpo as a global assesment and then compute the effect (how?) for those regressors which come at teh first place (supposing I have more than 40 and I'm interested only on the first top ten)
Thank you