Wang Weibo, Li Junwen, Fang Yu, Zheng Yongkang, You Fang
School of Electrical and Electronic Information, Xihua University, Chengdu 610039, People's Republic of China.
State Grid Sichuan Electric Power Research Institute, Chengdu 610072, People's Republic of China.
Physiol Meas. 2023 Oct 31;44(10). doi: 10.1088/1361-6579/acff35.
. Sleep staging is the basis for sleep quality assessment and diagnosis of sleep-related disorders. In response to the inadequacy of traditional manual judgement of sleep stages, using machine learning techniques for automatic sleep staging has become a hot topic. To improve the performance of sleep staging, numerous studies have extracted a large number of sleep-related characteristics. However, there are redundant and irrelevant features in the high-dimensional features that reduce the classification accuracy. To address this issue, an effective hybrid feature selection method based on the entropy weight method is proposed in this paper for automatic sleep staging.. Firstly, we preprocess the four modal polysomnography (PSG) signals, including electroencephalogram (EEG), electrooculogram (EOG), electrocardiogram (ECG) and electromyogram (EMG). Secondly, the time domain, frequency domain and nonlinear features are extracted from the preprocessed signals, with a total of 185 features. Then, in order to acquire characteristics of the multi-modal signals that are highly correlated with the sleep stages, the proposed hybrid feature selection method is applied to choose effective features. This method is divided into two stages. In stage I, the entropy weight method is employed to combine two filter methods to build a subset of features. This stage evaluates features based on information theory and distance metrics, which can quickly obtain a subset of features and retain the relevant features. In stage II, Sequential Forward Selection is used to evaluate the subset of features and eliminate redundant features. Further more, to achieve better performance of classification, an ensemble model based on support vector machine, K-nearest neighbor, random forest and multilayer perceptron is finally constructed for classifying sleep stages.. The experiment using the Cyclic Alternating Pattern (CAP) sleep database is performed to assess the performance of the method proposed in this paper. The proposed hybrid feature selection method chooses only 30 features highly correlated to sleep stages. The accuracy, F1 score and Kappa coefficient of 6 class sleep staging reach 88.86%, 83.15% and 0.8531%, respectively.. Experimental results show the effectiveness of the proposed method compared to the existing state-of-the-art studies. It greatly reduces the number of features required while achieving outstanding auto-sleep staging results.
睡眠分期是睡眠质量评估和睡眠相关障碍诊断的基础。针对传统人工判断睡眠分期的不足,利用机器学习技术进行自动睡眠分期已成为一个热门话题。为了提高睡眠分期的性能,众多研究提取了大量与睡眠相关的特征。然而,高维特征中存在冗余和不相关的特征,降低了分类准确率。为了解决这个问题,本文提出了一种基于熵权法的有效混合特征选择方法用于自动睡眠分期。首先,我们对包括脑电图(EEG)、眼电图(EOG)、心电图(ECG)和肌电图(EMG)在内的四种多导睡眠图(PSG)信号进行预处理。其次,从预处理后的信号中提取时域、频域和非线性特征,共185个特征。然后,为了获取与睡眠分期高度相关的多模态信号特征,应用所提出的混合特征选择方法选择有效特征。该方法分为两个阶段。在第一阶段,采用熵权法结合两种滤波方法构建特征子集。此阶段基于信息论和距离度量对特征进行评估,能够快速获得特征子集并保留相关特征。在第二阶段,使用顺序前向选择对特征子集进行评估并消除冗余特征。此外,为了实现更好的分类性能,最终构建了一个基于支持向量机、K近邻、随机森林和多层感知器的集成模型用于睡眠分期分类。使用循环交替模式(CAP)睡眠数据库进行实验,以评估本文提出的方法的性能。所提出的混合特征选择方法仅选择了30个与睡眠分期高度相关的特征。6类睡眠分期的准确率、F1分数和卡帕系数分别达到88.86%、83.15%和0.8531%。实验结果表明,与现有最先进的研究相比,本文提出的方法是有效的。它在实现出色的自动睡眠分期结果的同时,大大减少了所需的特征数量。