Institute for Innovative Learning, Mahidol University, Thailand.
Department of Biomedical Engineering, Faculty of Engineering, Mahidol University, Thailand.
Comput Biol Med. 2024 Nov;182:109138. doi: 10.1016/j.compbiomed.2024.109138. Epub 2024 Sep 20.
Numerous automatic sleep stage classification systems have been developed, but none have become effective assistive tools for sleep technicians due to issues with generalization. Four key factors hinder the generalization of these models are instruments, montage of recording, subject type, and scoring manual factors. This study aimed to develop a deep learning model that addresses generalization problems by integrating enzyme-inspired specificity and employing separating training approaches. Subject type and scoring manual factors were controlled, while the focus was on instruments and montage of recording factors. The proposed model consists of three sets of signal-specific models including EEG-, EOG-, and EMG-specific model. The EEG-specific models further include three sets of channel-specific models. All signal-specific and channel-specific models were established with data manipulation and weighted loss strategies, resulting in three sets of data manipulation models and class-specific models, respectively. These models were CNNs. Additionally, BiLSTM models were applied to EEG- and EOG-specific models to obtain temporal information. Finally, classification task for sleep stage was handled by 'the-last-dense' layer. The optimal sampling frequency for each physiological signal was identified and used during the training process. The proposed model was trained on MGH dataset and evaluated using both within dataset and cross-dataset. For MGH dataset, overall accuracy of 81.05 %, MF1 of 79.05 %, Kappa of 0.7408, and per-class F1-scores: W (84.98 %), N1 (58.06 %), N2 (84.82 %), N3 (79.20 %), and REM (88.17 %) can be achieved. Performances on cross-datasets are as follows: SHHS1 200 records reached 79.54 %, 70.56 %, and 0.7078; SHHS2 200 records achieved 76.77 %, 66.30 %, and 0.6632; Sleep-EDF 153 records gained 78.52 %, 72.13 %, and 0.7031; and BCI-MU (local dataset) 94 records achieved 83.57 %, 82.17 %, and 0.7769 for overall accuracy, MF1, and Kappa respectively. Additionally, the proposed model has approximately 9.3 M trainable parameters and takes around 26 s to process one PSG record. The results indicate that the proposed model demonstrates generalizability in sleep stage classification and shows potential as a feasibility tool for real-world applications. Additionally, enzyme-inspired specificity effectively addresses the challenges posed by varying montage of recording, while the identified optimal frequencies mitigate instrument-related issues.
已经开发出许多自动睡眠阶段分类系统,但由于泛化问题,没有一个成为睡眠技术人员有效的辅助工具。有四个关键因素阻碍了这些模型的推广:仪器、记录的导联、受试类型和评分手册因素。本研究旨在开发一种深度学习模型,通过集成酶启发的特异性和采用分离的训练方法来解决泛化问题。控制受试类型和评分手册因素,重点关注仪器和记录导联因素。所提出的模型由三组信号特异性模型组成,包括 EEG-、EOG- 和 EMG-特异性模型。EEG-特异性模型进一步包括三组通道特异性模型。所有信号特异性和通道特异性模型均通过数据操作和加权损失策略建立,分别得到三组数据操作模型和类特异性模型。这些模型是 CNN。此外,BiLSTM 模型应用于 EEG 和 EOG 特异性模型以获取时间信息。最后,通过 'the-last-dense' 层处理睡眠阶段的分类任务。确定了每个生理信号的最佳采样频率,并在训练过程中使用。所提出的模型在 MGH 数据集上进行训练,并在内部数据集和交叉数据集上进行评估。对于 MGH 数据集,整体准确率为 81.05%,MF1 为 79.05%,Kappa 为 0.7408,每个类别的 F1 得分为:W(84.98%)、N1(58.06%)、N2(84.82%)、N3(79.20%)和 REM(88.17%)。交叉数据集上的性能如下:SHHS1 200 条记录达到 79.54%、70.56%和 0.7078;SHHS2 200 条记录达到 76.77%、66.30%和 0.6632;Sleep-EDF 153 条记录达到 78.52%、72.13%和 0.7031;BCI-MU(本地数据集)94 条记录达到整体准确率 83.57%、MF1 82.17%和 Kappa 7769%。此外,所提出的模型具有约 9.3M 个可训练参数,处理一个 PSG 记录大约需要 26 秒。结果表明,所提出的模型在睡眠阶段分类中表现出可泛化性,并显示出作为现实应用可行性工具的潜力。此外,酶启发的特异性有效地解决了记录导联变化带来的挑战,而确定的最佳频率则减轻了仪器相关问题。