• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

两个欧洲睡眠中心的手动评分与基于人工智能的斯坦福-STAGES 算法的自动评分之间的睡眠分期评分者间可靠性。

Interrater sleep stage scoring reliability between manual scoring from two European sleep centers and automatic scoring performed by the artificial intelligence-based Stanford-STAGES algorithm.

机构信息

Department of Neurology, Medical University of Innsbruck, Innsbruck, Austria.

Interdisciplinary Sleep Medicine Center, Charité-Universitätsmedizin Berlin, Berlin, Germany.

出版信息

J Clin Sleep Med. 2021 Jun 1;17(6):1237-1247. doi: 10.5664/jcsm.9174.

DOI:10.5664/jcsm.9174
PMID:33599203
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8314654/
Abstract

STUDY OBJECTIVES

The objective of this study was to evaluate interrater reliability between manual sleep stage scoring performed in 2 European sleep centers and automatic sleep stage scoring performed by the previously validated artificial intelligence-based Stanford-STAGES algorithm.

METHODS

Full night polysomnographies of 1,066 participants were included. Sleep stages were manually scored in Berlin and Innsbruck sleep centers and automatically scored with the Stanford-STAGES algorithm. For each participant, we compared (1) Innsbruck to Berlin scorings (INN vs BER); (2) Innsbruck to automatic scorings (INN vs AUTO); (3) Berlin to automatic scorings (BER vs AUTO); (4) epochs where scorers from Innsbruck and Berlin had consensus to automatic scoring (CONS vs AUTO); and (5) both Innsbruck and Berlin manual scorings (MAN) to the automatic ones (MAN vs AUTO). Interrater reliability was evaluated with several measures, including overall and sleep stage-specific Cohen's κ.

RESULTS

Overall agreement across participants was substantial for INN vs BER (κ = 0.66 ± 0.13), INN vs AUTO (κ = 0.68 ± 0.14), CONS vs AUTO (κ = 0.73 ± 0.14), and MAN vs AUTO (κ = 0.61 ± 0.14), and moderate for BER vs AUTO (κ = 0.55 ± 0.15). Human scorers had the highest disagreement for N1 sleep (κ = 0.40 ± 0.16 for INN vs BER). Automatic scoring had lowest agreement with manual scorings for N1 and N3 sleep (κ = 0.25 ± 0.14 and κ = 0.42 ± 0.32 for MAN vs AUTO).

CONCLUSIONS

Interrater reliability for sleep stage scoring between human scorers was in line with previous findings, and the algorithm achieved an overall substantial agreement with manual scoring. In this cohort, the Stanford-STAGES algorithm showed similar performances to the ones achieved in the original study, suggesting that it is generalizable to new cohorts. Before its integration in clinical practice, future independent studies should further evaluate it in other cohorts.

摘要

研究目的

本研究旨在评估在两个欧洲睡眠中心进行的手动睡眠分期评分与之前经过验证的基于人工智能的斯坦福-STAGES 算法的自动睡眠分期评分之间的评分者间可靠性。

方法

共纳入 1066 名参与者的整夜多导睡眠图。在柏林和因斯布鲁克睡眠中心进行手动睡眠分期评分,并使用斯坦福-STAGES 算法进行自动评分。对于每个参与者,我们比较了(1)因斯布鲁克与柏林评分(INN 与 BER);(2)因斯布鲁克与自动评分(INN 与 AUTO);(3)柏林与自动评分(BER 与 AUTO);(4)因斯布鲁克和柏林评分者达成共识的自动评分(CONS 与 AUTO);以及(5)因斯布鲁克和柏林的手动评分(MAN)与自动评分(MAN 与 AUTO)。采用多种措施评估评分者间可靠性,包括总体和睡眠分期特异性 Cohen's κ。

结果

参与者之间的整体一致性对于 INN 与 BER(κ=0.66±0.13)、INN 与 AUTO(κ=0.68±0.14)、CONS 与 AUTO(κ=0.73±0.14)和 MAN 与 AUTO(κ=0.61±0.14)均为高度一致,对于 BER 与 AUTO(κ=0.55±0.15)则为中度一致。人类评分者对 N1 睡眠的分歧最大(INN 与 BER 的 κ=0.40±0.16)。自动评分与手动评分的一致性最低,N1 和 N3 睡眠的 κ 值分别为 0.25±0.14 和 0.42±0.32(MAN 与 AUTO)。

结论

人类评分者之间的睡眠分期评分的评分者间可靠性与之前的研究结果一致,该算法与手动评分总体上具有高度一致性。在本队列中,斯坦福-STAGES 算法的表现与原始研究中的表现相似,表明它可以推广到新的队列。在将其整合到临床实践之前,未来的独立研究应在其他队列中进一步评估它。

相似文献

1
Interrater sleep stage scoring reliability between manual scoring from two European sleep centers and automatic scoring performed by the artificial intelligence-based Stanford-STAGES algorithm.两个欧洲睡眠中心的手动评分与基于人工智能的斯坦福-STAGES 算法的自动评分之间的睡眠分期评分者间可靠性。
J Clin Sleep Med. 2021 Jun 1;17(6):1237-1247. doi: 10.5664/jcsm.9174.
2
Scoring sleep with artificial intelligence enables quantification of sleep stage ambiguity: hypnodensity based on multiple expert scorers and auto-scoring.人工智能评分可量化睡眠分期的不明确性:基于多位专家评分和自动评分的催眠密度。
Sleep. 2023 Feb 8;46(2). doi: 10.1093/sleep/zsac154.
3
Interrater agreement between American and Chinese sleep centers according to the 2014 AASM standard.中美睡眠中心之间根据 2014 年 AASM 标准的一致性。
Sleep Breath. 2019 Jun;23(2):719-728. doi: 10.1007/s11325-019-01801-x. Epub 2019 Feb 19.
4
Scoring accuracy of automated sleep staging from a bipolar electroocular recording compared to manual scoring by multiple raters.双极眼动记录的自动睡眠分期与多位评分者的手动评分的准确性比较。
Sleep Med. 2013 Nov;14(11):1199-207. doi: 10.1016/j.sleep.2013.04.022. Epub 2013 Aug 16.
5
Interrater reliability of sleep stage scoring: a meta-analysis.睡眠分期评分的组内信度:一项荟萃分析。
J Clin Sleep Med. 2022 Jan 1;18(1):193-202. doi: 10.5664/jcsm.9538.
6
Process and outcome for international reliability in sleep scoring.睡眠评分国际可靠性的过程与结果
Sleep Breath. 2015 Mar;19(1):191-5. doi: 10.1007/s11325-014-0990-0. Epub 2014 May 7.
7
Multicentre sleep-stage scoring agreement in the Sleep Revolution project.多中心睡眠分期评分在睡眠革命项目中的一致性。
J Sleep Res. 2024 Feb;33(1):e13956. doi: 10.1111/jsr.13956. Epub 2023 Jun 13.
8
The reliability and functional validity of visual and semiautomatic sleep/wake scoring in the Møll-Wistar rat.莫尔-威斯塔大鼠视觉及半自动睡眠/觉醒评分的可靠性和功能有效性
Sleep. 1994 Mar;17(2):120-31. doi: 10.1093/sleep/17.2.120.
9
Variability of sleep stage scoring in late midlife and early old age.中年晚期和老年早期睡眠阶段评分的变异性。
J Sleep Res. 2022 Feb;31(1):e13424. doi: 10.1111/jsr.13424. Epub 2021 Jun 24.
10
Staging Sleep in Polysomnograms: Analysis of Inter-Scorer Variability.多导睡眠图中的睡眠分期:评分者间变异性分析
J Clin Sleep Med. 2016 Jun 15;12(6):885-94. doi: 10.5664/jcsm.5894.

引用本文的文献

1
An important step toward automation of polysomnography analyses.多导睡眠图分析自动化的重要一步。
Sleep. 2025 Aug 14;48(8). doi: 10.1093/sleep/zsaf147.
2
Sleep as the Foundation of Brain Health.睡眠是大脑健康的基础。
Semin Neurol. 2025 May;45(3):305-316. doi: 10.1055/a-2566-4073. Epub 2025 Mar 26.
3
Semi automatic quantification of REM sleep without atonia in natural sleep environment.在自然睡眠环境中对无肌张力快速眼动睡眠进行半自动量化。
NPJ Digit Med. 2024 Nov 28;7(1):341. doi: 10.1038/s41746-024-01354-8.
4
Machine learning models including patient-reported outcome data in oncology: a systematic literature review and analysis of their reporting quality.机器学习模型在肿瘤学中纳入患者报告结局数据:系统文献回顾和报告质量分析。
J Patient Rep Outcomes. 2024 Nov 5;8(1):126. doi: 10.1186/s41687-024-00808-7.
5
A deep learning algorithm model to automatically score and grade obstructive sleep apnea in adult polysomnography.一种用于在成人多导睡眠图中自动对阻塞性睡眠呼吸暂停进行评分和分级的深度学习算法模型。
Digit Health. 2024 Oct 17;10:20552076241291707. doi: 10.1177/20552076241291707. eCollection 2024 Jan-Dec.
6
Machine Learning Predicts Phenoconversion from Polysomnography in Isolated REM Sleep Behavior Disorder.机器学习通过多导睡眠图预测孤立性快速眼动睡眠行为障碍中的表型转换。
Brain Sci. 2024 Aug 28;14(9):871. doi: 10.3390/brainsci14090871.
7
Temporal progression of sleep electroencephalography features in isolated rapid eye movement sleep behaviour disorder.孤立性快速眼动睡眠行为障碍中睡眠脑电图特征的时间进展
J Sleep Res. 2025 Jun;34(3):e14351. doi: 10.1111/jsr.14351. Epub 2024 Sep 25.
8
Artificial Intelligence Models for the Automation of Standard Diagnostics in Sleep Medicine-A Systematic Review.用于睡眠医学标准诊断自动化的人工智能模型——一项系统综述
Bioengineering (Basel). 2024 Feb 22;11(3):206. doi: 10.3390/bioengineering11030206.
9
Strengths, weaknesses, opportunities, and threats of using AI-enabled technology in sleep medicine: a commentary.利用人工智能技术在睡眠医学中的优势、劣势、机遇和威胁:述评。
J Clin Sleep Med. 2024 Jul 1;20(7):1183-1191. doi: 10.5664/jcsm.11132.
10
Evaluation of consensus sleep stage scoring of dysregulated sleep in Parkinson's disease.帕金森病睡眠失调的共识睡眠分期评估。
Sleep Med. 2023 Jul;107:236-242. doi: 10.1016/j.sleep.2023.04.031. Epub 2023 May 18.

本文引用的文献

1
Sleep modelled as a continuous and dynamic process predicts healthy ageing better than traditional sleep scoring.将睡眠建模为一个连续且动态的过程,比传统的睡眠评分更能预测健康衰老。
Sleep Med. 2021 Jan;77:136-146. doi: 10.1016/j.sleep.2020.11.033. Epub 2020 Dec 5.
2
Towards More Accurate Automatic Sleep Staging via Deep Transfer Learning.通过深度迁移学习实现更准确的自动睡眠分期。
IEEE Trans Biomed Eng. 2021 Jun;68(6):1787-1798. doi: 10.1109/TBME.2020.3020381. Epub 2021 May 21.
3
Expert-level automated sleep staging of long-term scalp electroencephalography recordings using deep learning.使用深度学习对长期头皮脑电图记录进行专家级别的自动睡眠分期。
Sleep. 2020 Nov 12;43(11). doi: 10.1093/sleep/zsaa112.
4
Reinventing polysomnography in the age of precision medicine.精准医学时代的睡眠监测技术革新。
Sleep Med Rev. 2020 Aug;52:101313. doi: 10.1016/j.smrv.2020.101313. Epub 2020 Mar 20.
5
Artificial intelligence in sleep medicine: background and implications for clinicians.睡眠医学中的人工智能:背景及对临床医生的启示
J Clin Sleep Med. 2020 Apr 15;16(4):609-618. doi: 10.5664/jcsm.8388.
6
Automated sleep scoring: A review of the latest approaches.自动睡眠评分:最新方法综述。
Sleep Med Rev. 2019 Dec;48:101204. doi: 10.1016/j.smrv.2019.07.007. Epub 2019 Aug 9.
7
Interrater agreement between American and Chinese sleep centers according to the 2014 AASM standard.中美睡眠中心之间根据 2014 年 AASM 标准的一致性。
Sleep Breath. 2019 Jun;23(2):719-728. doi: 10.1007/s11325-019-01801-x. Epub 2019 Feb 19.
8
Prevalence and associated risk factors of periodic limb movement in sleep in two German population-based studies.两项德国基于人群的研究中睡眠周期性肢体运动的患病率及相关危险因素。
Sleep. 2019 Mar 1;42(3). doi: 10.1093/sleep/zsy237.
9
Neural network analysis of sleep stages enables efficient diagnosis of narcolepsy.神经网络分析睡眠阶段有助于嗜睡症的高效诊断。
Nat Commun. 2018 Dec 6;9(1):5229. doi: 10.1038/s41467-018-07229-3.
10
Reliability of the American Academy of Sleep Medicine Rules for Assessing Sleep Depth in Clinical Practice.美国睡眠医学学会评估临床睡眠深度规则的可靠性。
J Clin Sleep Med. 2018 Feb 15;14(2):205-213. doi: 10.5664/jcsm.6934.