2

Is there any role of redundancy analysis (for example using the redun() function of the Hmisc package in R) in finding variables to be included for a regression model? It seems logical but are there any points to be kept in mind?

Scortchi - Reinstate Monica
  • 27,560
  • 8
  • 81
  • 248
rnso
  • 8,893
  • 14
  • 50
  • 94

1 Answers1

2

It's extremely useful, especially as it allows for non-monotonic relationships between predictors, but the caveats given in Low variance components in PCA, are they really just noise? Is there any way to test for it? & Examples of PCA where PCs with low variance are “useful” apply—the assumption that if two candidate predictors contain much the same information the difference between them is unimportant for explaining the response is one that has to be judged using subject-matter knowledge.

Scortchi - Reinstate Monica
  • 27,560
  • 8
  • 81
  • 248