Jeong Jaemin, Yoon Wonhyuck, Lee Jeong-Gun, Kim Dongyoung, Woo Yunhee, Kim Dong-Kyu, Shin Hyun-Woo
Department of Computer Engineering, School of Software, Hallym University, Chuncheon, Republic of Korea.
OUaR LaB, Inc, Seoul, Republic of Korea.
Sleep. 2023 Dec 11;46(12). doi: 10.1093/sleep/zsad242.
Polysomnography (PSG) scoring is labor-intensive, subjective, and often ambiguous. Recently several deep learning (DL) models for automated sleep scoring have been developed, they are tied to a fixed amount of input channels and resolution. In this study, we constructed a standardized image-based PSG dataset in order to overcome the heterogeneity of raw signal data obtained from various PSG devices and various sleep laboratory environments.
All individually exported European data format files containing raw signals were converted into images with an annotation file, which contained the demographics, diagnoses, and sleep statistics. An image-based DL model for automatic sleep staging was developed, compared with a signal-based model, and validated in an external dataset.
We constructed 10253 image-based PSG datasets using a standardized format. Among these, 7745 diagnostic PSG data were used to develop our DL model. The DL model using the image dataset showed similar performance to the signal-based dataset for the same subject. The overall DL accuracy was greater than 80%, even with severe obstructive sleep apnea. Moreover, for the first time, we showed explainable DL in the field of sleep medicine as visualized key inference regions using Eigen-class activation maps. Furthermore, when a DL model for sleep scoring performs external validation, we achieved a relatively good performance.
Our main contribution demonstrates the availability of a standardized image-based dataset, and highlights that changing the data sampling rate or number of sensors may not require retraining, although performance decreases slightly as the number of sensors decreases.
多导睡眠图(PSG)评分工作强度大、主观性强且常常模棱两可。最近已开发出几种用于自动睡眠评分的深度学习(DL)模型,但它们与固定数量的输入通道和分辨率相关联。在本研究中,我们构建了一个标准化的基于图像的PSG数据集,以克服从各种PSG设备和各种睡眠实验室环境获得的原始信号数据的异质性。
所有单独导出的包含原始信号的欧洲数据格式文件都被转换为带有注释文件的图像,该注释文件包含人口统计学信息、诊断结果和睡眠统计数据。开发了一种基于图像的自动睡眠分期DL模型,与基于信号的模型进行比较,并在外部数据集中进行验证。
我们使用标准化格式构建了10253个基于图像的PSG数据集。其中,7745个诊断性PSG数据用于开发我们的DL模型。使用图像数据集的DL模型对同一受试者显示出与基于信号的数据集相似的性能。即使对于严重阻塞性睡眠呼吸暂停,DL的总体准确率也大于80%。此外,我们首次在睡眠医学领域展示了可解释的DL,即使用特征类激活图可视化关键推理区域。此外,当用于睡眠评分的DL模型进行外部验证时,我们取得了相对较好的性能。
我们的主要贡献证明了标准化的基于图像的数据集的可用性,并强调改变数据采样率或传感器数量可能不需要重新训练,尽管随着传感器数量的减少性能会略有下降。