There is a cluster criterion defined as:
$$\mathcal{C} = \operatorname{tr}(S_W^{-1}S_B) = \sum_{i=1}^d \lambda_i,$$
where $\operatorname{tr}$ is the trace, $S_W$ is the pooled within-group scatter matrix, and $S_B$ is the between-group scatter matrix; $d$ is the number of features (or dimensions of the scatter matrices). I have two sources for it, one here (eq 103) and the other here (p.22).
The R package clusterCrit
computes this as the matrix inverse of $S_W$ (I compared their output with a "manual" calculation), but the slides, based on the book by Duda, call this "the ratio of between to within-cluster scatter in the direction of eigenvectors".
My questions:
Is it the matrix inverse or a division? I find a similar term in linear discriminant analysis which is clearly a division.EDIT: Clearly this is the matrix inverse (no "matrix division").In the direction of which eigenvectors? What does that direction represent?What do the eigenvectors of $S_W^{-1}S_B$ represent?If this is the matrix inverse, thenI understand that $S_W^{-1}$ can be interpreted as a precision matrix. What is the intuitive interpretation of the trace of $S_W^{-1}S_B$?