IEEE Trans Neural Syst Rehabil Eng. 2021;29:2076-2085. doi: 10.1109/TNSRE.2021.3117970. Epub 2021 Oct 19.
Deep learning is widely used in the most recent automatic sleep scoring algorithms. Its popularity stems from its excellent performance and from its ability to process raw signals and to learn feature directly from the data. Most of the existing scoring algorithms exploit very computationally demanding architectures, due to their high number of training parameters, and process lengthy time sequences in input (up to 12 minutes). Only few of these architectures provide an estimate of the model uncertainty. In this study we propose DeepSleepNet-Lite, a simplified and lightweight scoring architecture, processing only 90-seconds EEG input sequences. We exploit, for the first time in sleep scoring, the Monte Carlo dropout technique to enhance the performance of the architecture and to also detect the uncertain instances. The evaluation is performed on a single-channel EEG Fpz-Cz from the open source Sleep-EDF expanded database. DeepSleepNet-Lite achieves slightly lower performance, if not on par, compared to the existing state-of-the-art architectures, in overall accuracy, macro F1-score and Cohen's kappa (on Sleep-EDF v1-2013 ±30mins: 84.0%, 78.0%, 0.78; on Sleep-EDF v2-2018 ±30mins: 80.3%, 75.2%, 0.73). Monte Carlo dropout enables the estimate of the uncertain predictions. By rejecting the uncertain instances, the model achieves higher performance on both versions of the database (on Sleep-EDF v1-2013 ±30mins: 86.1.0%, 79.6%, 0.81; on Sleep-EDF v2-2018 ±30mins: 82.3%, 76.7%, 0.76). Our lighter sleep scoring approach paves the way to the application of scoring algorithms for sleep analysis in real-time.
深度学习在最近的自动睡眠评分算法中得到了广泛应用。它的流行源于其出色的性能,以及直接从数据中处理原始信号和学习特征的能力。大多数现有的评分算法都利用了非常耗费计算资源的架构,这是由于它们的训练参数数量很多,并且需要处理输入的长时序列(长达 12 分钟)。这些架构中只有少数提供模型不确定性的估计。在这项研究中,我们提出了 DeepSleepNet-Lite,这是一种简化和轻量级的评分架构,只处理 90 秒的 EEG 输入序列。我们首次在睡眠评分中利用蒙特卡罗辍学技术来提高架构的性能,并检测不确定的实例。评估是在开放源代码的 Sleep-EDF 扩展数据库的单通道 EEG Fpz-Cz 上进行的。与现有的最先进的架构相比,DeepSleepNet-Lite 的整体准确性、宏观 F1 分数和 Cohen 的 kappa 略有下降(在 Sleep-EDF v1-2013 ±30 分钟时为 84.0%、78.0%和 0.78;在 Sleep-EDF v2-2018 ±30 分钟时为 80.3%、75.2%和 0.73)。蒙特卡罗辍学可以估计不确定的预测。通过拒绝不确定的实例,该模型在两个版本的数据库上都实现了更高的性能(在 Sleep-EDF v1-2013 ±30 分钟时为 86.1.0%、79.6%和 0.81;在 Sleep-EDF v2-2018 ±30 分钟时为 82.3%、76.7%和 0.76)。我们更轻量级的睡眠评分方法为实时睡眠分析中的评分算法的应用铺平了道路。