Are principal components best fitting lines?

Asked Oct 19 '21 at 10:00

Active Oct 19 '21 at 10:00

Viewed 45 times

The beginning of the Wikipedia article on PCA seems completely wrong to me (my italics):

“The principal components of a collection of points in a real coordinate space are a sequence of p unit vectors, where the i-th vector is the direction of a line that best fits the data while being orthogonal to the first i-1 vectors. Here, a best-fitting line is defined as one that minimizes the average squared distance from the points to the line.”

This sounds like regression but with PCA principal components are chosen that maximize the variance of the transformed data; no line is being “fit.” Am I missing something?

asked Oct 19 '21 at 10:00

Jon

2

Very much relevant: [Making sense of principal component analysis, eigenvectors & eigenvalues](https://stats.stackexchange.com/q/2691/1352) – Stephan Kolassa Oct 19 '21 at 10:05

Are principal components best fitting lines?

0 Answers0