Sheikholeslami N, Stashuk D
Department of Electrical Engineering, McGill University, Montreal, Quebec, Canada.
Med Biol Eng Comput. 1997 Nov;35(6):661-70. doi: 10.1007/BF02510975.
A new supervised mutual information-based feature selection method is presented. Using real motor unit action potential (MUAP) data from 10 EMG signals, the performances of 32 time-sample feature sets, feature subsets selected using first- and second-order mutual information and features obtained using linear discriminant analysis (LDA) and principal component analysis (PCA) were evaluated using a minimum Euclidean distance (MED) classifier. The evaluation showed that by using only 20 first-order features or only 15 second-order features mean error rates and error rate variations equivalent to using all 32 samples or LDA or PCA could be obtained. The computational cost of first-order feature selection was considerably less than LDA, PCA and second-order feature selection. The performance of first-order features was further evaluated using a more robust classifier. Unlike the MED classifier, the robust classifier only assigned a candidate MUAP if the assignment was sufficiently certain. For the robust classifier the average error rates using 20 features were similar to using the full feature set, yet higher assignment rates were obtained. Results from both evaluations suggest that the sets of first-order features were an efficient representation of lower dimension, which provided high accuracy classification with reduced computational requirements.
提出了一种基于监督互信息的新特征选择方法。使用来自10个肌电图(EMG)信号的真实运动单位动作电位(MUAP)数据,利用最小欧几里得距离(MED)分类器评估了32个时间样本特征集、使用一阶和二阶互信息选择的特征子集以及使用线性判别分析(LDA)和主成分分析(PCA)获得的特征的性能。评估表明,仅使用20个一阶特征或仅15个二阶特征,就可以获得与使用所有32个样本或LDA或PCA相当的平均错误率和错误率变化。一阶特征选择的计算成本远低于LDA、PCA和二阶特征选择。使用更强大的分类器进一步评估了一阶特征的性能。与MED分类器不同,强大的分类器仅在分配足够确定时才分配候选MUAP。对于强大的分类器,使用20个特征的平均错误率与使用完整特征集相似,但获得了更高的分配率。两次评估的结果都表明,一阶特征集是低维的有效表示,它在降低计算需求的情况下提供了高精度分类。