Wang Shuaifang, Li Yan, Wen Peng, Lai David
Faculty of Health, Engineering and Sciences, University of Southern Queensland, Toowoomba, QLD, 4350, Australia.
Australas Phys Eng Sci Med. 2016 Mar;39(1):157-65. doi: 10.1007/s13246-015-0414-x. Epub 2016 Jan 5.
The alcoholism can be detected by analyzing electroencephalogram (EEG) signals. However, analyzing multi-channel EEG signals is a challenging task, which often requires complicated calculations and long execution time. This paper proposes three data selection methods to extract representative data from the EEG signals of alcoholics. The methods are the principal component analysis based on graph entropy (PCA-GE), the channel selection based on graph entropy (GE) difference, and the mathematic combinations channel selection, respectively. For comparison purposes, the selected data from the three methods are then classified by three classifiers: the J48 decision tree, the K-nearest neighbor and the Kstar, separately. The experimental results show that the proposed methods are successful in selecting data without compromising the classification accuracy in discriminating the EEG signals from alcoholics and non-alcoholics. Among them, the proposed PCA-GE method uses only 29.69% of the whole data and 29.5% of the computation time but achieves a 94.5% classification accuracy. The channel selection method based on the GE difference also gains a 91.67% classification accuracy by using only 29.69% of the full size of the original data. Using as little data as possible without sacrificing the final classification accuracy is useful for online EEG analysis and classification application design.
酒精中毒可以通过分析脑电图(EEG)信号来检测。然而,分析多通道EEG信号是一项具有挑战性的任务,通常需要复杂的计算和较长的执行时间。本文提出了三种数据选择方法,从酗酒者的EEG信号中提取代表性数据。这些方法分别是基于图熵的主成分分析(PCA-GE)、基于图熵(GE)差异的通道选择以及数学组合通道选择。为了进行比较,然后分别使用三种分类器对从这三种方法中选择的数据进行分类:J48决策树、K近邻和Kstar。实验结果表明,所提出的方法在选择数据时能够成功地在不影响区分酗酒者和非酗酒者的EEG信号分类准确率的情况下进行。其中,所提出的PCA-GE方法仅使用了全部数据的29.69%和29.5%的计算时间,但实现了94.5%的分类准确率。基于GE差异的通道选择方法通过仅使用原始数据全尺寸的29.69%也获得了91.67%的分类准确率。在不牺牲最终分类准确率的情况下尽可能少地使用数据,对于在线EEG分析和分类应用设计是有用的。