Rusanen Matias, Jouan Gabriel, Huttunen Riku, Nikkonen Sami, Sigurðardóttir Sigríður, Töyräs Juha, Duce Brett, Myllymaa Sami, Arnardottir Erna Sif, Leppänen Timo, Islind Anna Sigridur, Kainulainen Samu, Korkalainen Henri
Department of Technical Physics, University of Eastern Finland, Kuopio, Finland.
Diagnostic Imaging Centre, Kuopio University Hospital, Kuopio, Finland.
J Sleep Res. 2025 Jun;34(3):e14362. doi: 10.1111/jsr.14362. Epub 2024 Oct 23.
State-of-the-art automatic sleep staging methods have demonstrated comparable reliability and superior time efficiency to manual sleep staging. However, fully automatic black-box solutions are difficult to adapt into clinical workflow due to the lack of transparency in decision-making processes. Transparency would be crucial for interaction between automatic methods and the work of sleep experts, i.e., in human-in-the-loop applications. To address these challenges, we propose an automatic sleep staging model (aSAGA) that effectively utilises both electroencephalography and electro-oculography channels while incorporating transparency of uncertainty in the decision-making process. We validated the model through extensive retrospective testing using a range of datasets, including open-access, clinical, and research-driven sources. Our channel-wise ensemble model, trained on both electroencephalography and electro-oculography signals, demonstrated robustness and the ability to generalise across various types of sleep recordings, including novel self-applied home polysomnography. Additionally, we compared model uncertainty with human uncertainty in sleep staging and studied various uncertainty mapping metrics to identify ambiguous regions, or "grey areas", that may require manual re-evaluation. The validation of this grey area concept revealed its potential to enhance sleep staging accuracy and to highlight regions in the recordings where sleep experts may struggle to reach a consensus. In conclusion, this study provides a technical basis and understanding of automatic sleep staging uncertainty. Our approach has the potential to improve the integration of automatic sleep staging into clinical practice; however, further studies are needed to test the model prospectively in real-world clinical settings and human-in-the-loop scoring applications.
先进的自动睡眠分期方法已证明与手动睡眠分期具有相当的可靠性和更高的时间效率。然而,由于决策过程缺乏透明度,全自动黑箱解决方案难以应用于临床工作流程。透明度对于自动方法与睡眠专家的工作之间的交互至关重要,即在人在回路应用中。为应对这些挑战,我们提出了一种自动睡眠分期模型(aSAGA),该模型有效利用脑电图和眼电图通道,同时在决策过程中纳入不确定性的透明度。我们使用一系列数据集(包括开放获取、临床和研究驱动来源)通过广泛的回顾性测试对该模型进行了验证。我们基于脑电图和眼电图信号训练的通道级集成模型展示了鲁棒性以及在各种类型的睡眠记录(包括新型自我应用的家庭多导睡眠图)上进行泛化的能力。此外,我们比较了睡眠分期中模型不确定性与人类不确定性,并研究了各种不确定性映射指标以识别可能需要人工重新评估的模糊区域或“灰色区域”。对这一灰色区域概念的验证揭示了其提高睡眠分期准确性以及突出记录中睡眠专家可能难以达成共识的区域的潜力。总之,本研究为自动睡眠分期不确定性提供了技术基础和理解。我们的方法有可能改善自动睡眠分期在临床实践中的整合;然而,需要进一步的研究在实际临床环境和人在回路评分应用中对该模型进行前瞻性测试。