Zheng Wenming, Lu Cheng, Lin Zhouchen, Zhang Tong, Cui Zhen, Yang Wankou
IEEE Trans Neural Netw Learn Syst. 2019 Oct;30(10):2898-2915. doi: 10.1109/TNNLS.2018.2863264. Epub 2018 Aug 29.
Fisher's criterion is one of the most popular discriminant criteria for feature extraction. It is defined as the generalized Rayleigh quotient of the between-class scatter distance to the within-class scatter distance. Consequently, Fisher's criterion does not take advantage of the discriminant information in the class covariance differences, and hence, its discriminant ability largely depends on the class mean differences. If the class mean distances are relatively large compared with the within-class scatter distance, Fisher's criterion-based discriminant analysis methods may achieve a good discriminant performance. Otherwise, it may not deliver good results. Moreover, we observe that the between-class distance of Fisher's criterion is based on the l -norm, which would be disadvantageous to separate the classes with smaller class mean distances. To overcome the drawback of Fisher's criterion, in this paper, we first derive a new discriminant criterion, expressed as a mixture of absolute generalized Rayleigh quotients, based on a Bayes error upper bound estimation, where mixture of Gaussians is adopted to approximate the real distribution of data samples. Then, the criterion is further modified by replacing l -norm with l one to better describe the between-class scatter distance, such that it would be more effective to separate the different classes. Moreover, we propose a novel l -norm heteroscedastic discriminant analysis method based on the new discriminant analysis (L1-HDA/GM) for heteroscedastic feature extraction, in which the optimization problem of L1-HDA/GM can be efficiently solved by using the eigenvalue decomposition approach. Finally, we conduct extensive experiments on four real data sets and demonstrate that the proposed method achieves much competitive results compared with the state-of-the-art methods.
费希尔准则是特征提取中最常用的判别准则之一。它被定义为类间散度距离与类内散度距离的广义瑞利商。因此,费希尔准则没有利用类协方差差异中的判别信息,其判别能力在很大程度上取决于类均值差异。如果类均值距离与类内散度距离相比相对较大,基于费希尔准则的判别分析方法可能会取得较好的判别性能。否则,可能无法得到好的结果。此外,我们观察到费希尔准则的类间距离基于l范数,这对于分离类均值距离较小的类是不利的。为了克服费希尔准则的缺点,在本文中,我们首先基于贝叶斯误差上界估计推导了一种新的判别准则,将其表示为绝对广义瑞利商的混合形式,其中采用高斯混合来近似数据样本的真实分布。然后,通过将l范数替换为l1范数对该准则进行进一步修改,以更好地描述类间散度距离,从而在分离不同类时更有效。此外,我们提出了一种基于新判别分析的新颖的l范数异方差判别分析方法(L1-HDA/GM)用于异方差特征提取,其中L1-HDA/GM的优化问题可以通过特征值分解方法有效地求解。最后,我们在四个真实数据集上进行了广泛的实验,并证明与现有方法相比,所提出的方法取得了更具竞争力的结果。