From this post:
$ w=S_{W}^{-1}(μ1−μ2), $
is used to estimate
$w_{0}=\frac{1}{2}(μ_{1}−μ_{2})^{T}S_{W}^{-1}(μ_{1}−μ_{2})−log(\frac{P1}{P2}),$
However, this is for a situation where there are only 2 classes. How can i adapt this so it can be applied to n number of classes (say 3 classes).
On Wikipedia, it's mentioned that:
Another common method is pairwise classification, where a new classifier is created for each pair of classes (giving C(C − 1)/2 classifiers in total), with the individual classifiers combined to produce a final classification.
But, how is this expressed in a similar way/formula to the above? Thanks.