Padma Shri T K, Sriraam N
Department of Electronics and Communication, Manipal Institute of Technology, Manipal University, Manipal, Karnataka, 576104, India.
Department of Medical Electronics, M.S. Ramaiah Institute of Technology (An Autonomous Institute, Affiliated to Visvesvaraya Technological University), Bangalore, Karnataka, 560054, India.
Brain Inform. 2017 Jun;4(2):147-158. doi: 10.1007/s40708-017-0061-y. Epub 2017 Jan 21.
This paper presents a novel ranking method to select spectral entropy (SE) features that discriminate alcoholic and control visual event-related potentials (ERP'S) in gamma sub-band (30-55 Hz) derived from a 64-channel electroencephalogram (EEG) recording. The ranking is based on a t test statistic that rejects the null hypothesis that the group means of SE values in alcoholics and controls are identical. The SE features with high ranks are indicative of maximal separation between their group means. Various sizes of top ranked feature subsets are evaluated by applying principal component analysis (PCA) and k-nearest neighbor (k-NN) classification. Even though ranking does not influence the performance of classifier significantly with the selection of all 61 active channels, the classification efficiency is directly proportional to the number of principal components (pc). The effect of ranking and PCA on classification is predominantly observed with reduced feature subsets of (N = 25, 15) top ranked features. Results indicate that for N = 25, proposed ranking method improves the k-NN classification accuracy from 91 to 93.87% as the number of pcs increases from 5 to 25. With same number of pcs, the k-NN classifier responds with accuracies of 84.42-91.54% with non-ranked features. Similarly for N = 15 and number of pcs varying from 5 to 15, ranking enhances k-NN detection accuracies from 88.9 to 93.08% as compared to 86.75-91.96% without ranking. This shows that the detection accuracy is increased by 6.5 and 2.8%, respectively, for N = 25, whereas it enhances by 2.2 and 1%, respectively, for N = 15 in comparison with non-ranked features. In the proposed t test ranking method for feature selection, the pcs of only top ranked feature candidates take part in classification process and hence provide better generalization.
本文提出了一种新颖的排序方法,用于从64通道脑电图(EEG)记录中提取的伽马子带(30 - 55Hz)中选择能够区分酒精成瘾者和对照者视觉事件相关电位(ERP)的频谱熵(SE)特征。该排序基于t检验统计量,用于拒绝酒精成瘾者和对照者SE值的组均值相同的原假设。高排名的SE特征表明其组均值之间的最大分离度。通过应用主成分分析(PCA)和k近邻(k-NN)分类对各种大小的顶级特征子集进行评估。尽管在选择所有61个活动通道时,排序对分类器的性能没有显著影响,但分类效率与主成分(pc)的数量成正比。排序和PCA对分类的影响主要在(N = 25, 15)顶级排名特征的减少特征子集上观察到。结果表明,对于N = 25,随着pc数量从5增加到25,所提出的排序方法将k-NN分类准确率从91%提高到93.87%。在相同数量的pc下,k-NN分类器对未排序特征的响应准确率为84.42 - 91.54%。同样,对于N = 15且pc数量从5变化到15,与未排序相比,排序将k-NN检测准确率从88.9%提高到93.08%,而未排序时为86.75 - 91.96%。这表明,与未排序特征相比,对于N = 25,检测准确率分别提高了6.5%和2.8%,而对于N = 15,检测准确率分别提高了2.2%和1%。在所提出的用于特征选择的t检验排序方法中,只有顶级排名特征候选的pcs参与分类过程,因此提供了更好的泛化能力。