In many modern data, the number of variables is much higher than the number of observations and the within-group scatter matrix is singular. Then, the Fisher's linear discriminant analysis (LDA) cannot be applied. The work considers a way to circumvent this problem by doing principal component analysis (PCA) enhanced with additional discriminating features. Two approaches are proposed: the original PCs are rotated to maximize the Fisher's LDA criterion, and second, penalized PCs are produced to achieve simultaneous dimension reduction and maximization of the Fisher's LDA criterion. Both approaches are illustrated and compared to other existing methods on several well known data sets.

Constraint principal components for linear discrimination

N. Trendafilov
;
M. Gallo;V. Simonacci;
2023-01-01

Abstract

In many modern data, the number of variables is much higher than the number of observations and the within-group scatter matrix is singular. Then, the Fisher's linear discriminant analysis (LDA) cannot be applied. The work considers a way to circumvent this problem by doing principal component analysis (PCA) enhanced with additional discriminating features. Two approaches are proposed: the original PCs are rotated to maximize the Fisher's LDA criterion, and second, penalized PCs are produced to achieve simultaneous dimension reduction and maximization of the Fisher's LDA criterion. Both approaches are illustrated and compared to other existing methods on several well known data sets.
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11574/230106
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
social impact