5

Should I concern about multicollinearity when building KNN model for the classification problem?

If yes. how to deal with it?

Peter Smith
  • 309
  • 2
  • 8

1 Answers1

5

Clustering analysis is not negatively affected by heteroscedasticity but the results are negatively impacted by multicollinearity of features/ variables used in clustering as the correlated feature/ variable will carry extra weight on the distance calculation than desired.

Solution: Do not drop any variables, but you can use principal component analysis and consider principal component analysis results.

sycamore24
  • 53
  • 8