Department of Sleep Medicine, Institute of Respiratory Diseases, Shenzhen People's Hospital, The Second Clinical Medical College of Jinan University, The First Affiliated Hospital of Southern University of Science and Technology, Shenzhen, 518020, China.
Tsinghua Shenzhen International Graduate School, Tsinghua University, Shenzhen, 518055, China.
Comput Biol Med. 2024 Sep;179:108855. doi: 10.1016/j.compbiomed.2024.108855. Epub 2024 Jul 18.
To compare the accuracy and generalizability of an automated deep neural network and the Philip Sleepware G3™ Somnolyzer system (Somnolyzer) for sleep stage scoring using American Academy of Sleep Medicine (AASM) guidelines.
Sleep recordings from 104 participants were analyzed by a convolutional neural network (CNN), the Somnolyzer and skillful technicians. Evaluation metrics were derived for different combinations of sleep stages. A further comparison between the Somnolyzer and the CNN model using a single-channel signal as input was also performed. Sleep recordings from 263 participants with a lower prevalence of OSA served as a cross-validation dataset to validate the generalizability of the CNN model.
The overall agreement between automated and manual scoring for sleep staging in 104 participants outperformed that of the Somnolyzer according to various metrics (accuracy: 81.81 % vs. 77.07 %; F1: 76.36 % vs. 73.80 %; Cohen's kappa: 0.7403 vs. 0.6848). The results showed that the left electrooculography (EOG) single-channel model had minor advantages over the Somnolyzer. In terms of consistency with manual sleep staging, the CNN model demonstrated superior performance in identifying more pronounced sleep transitions, particularly in the N2 stage and sleep latency metrics. Conversely, the Somnolyzer showed enhanced proficiency in the analysis of REM stages, notably in measuring REM latency. The accuracy in the cross-validation set of 263 participants was also above 80 %.
The CNN-based automated deep neural network outperformed the Somnolyzer and is sufficiently accurate for sleep study analyses using the AASM classification criteria.
比较自动深度神经网络和 Philip Sleepware G3™ Somnolyzer 系统(Somnolyzer)在使用美国睡眠医学学会(AASM)指南进行睡眠分期时的准确性和泛化能力。
使用卷积神经网络(CNN)、Somnolyzer 和熟练技术人员对 104 名参与者的睡眠记录进行分析。评估指标是针对不同的睡眠阶段组合得出的。还使用单通道信号作为输入,对 Somnolyzer 和 CNN 模型进行了进一步比较。来自 263 名睡眠呼吸暂停低通气综合征(OSA)患病率较低的参与者的睡眠记录作为交叉验证数据集,用于验证 CNN 模型的泛化能力。
根据各种指标,104 名参与者中自动和手动睡眠分期的整体一致性优于 Somnolyzer(准确性:81.81% vs. 77.07%;F1:76.36% vs. 73.80%;Cohen's kappa:0.7403 vs. 0.6848)。结果表明,左侧眼电图(EOG)单通道模型优于 Somnolyzer。在与手动睡眠分期的一致性方面,CNN 模型在识别更明显的睡眠转移动作方面表现出更好的性能,特别是在 N2 阶段和睡眠潜伏期指标上。相比之下,Somnolyzer 在 REM 阶段的分析中表现出更高的熟练程度,尤其是在测量 REM 潜伏期方面。在 263 名参与者的交叉验证集中,准确率也高于 80%。
基于 CNN 的自动深度神经网络优于 Somnolyzer,对于使用 AASM 分类标准进行睡眠研究分析,其准确性足够高。