Should I concern about multicollinearity when building KNN model for the classification problem?
If yes. how to deal with it?
Should I concern about multicollinearity when building KNN model for the classification problem?
If yes. how to deal with it?
Clustering analysis is not negatively affected by heteroscedasticity but the results are negatively impacted by multicollinearity of features/ variables used in clustering as the correlated feature/ variable will carry extra weight on the distance calculation than desired.
Solution: Do not drop any variables, but you can use principal component analysis and consider principal component analysis results.