• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

人工智能评分可量化睡眠分期的不明确性:基于多位专家评分和自动评分的催眠密度。

Scoring sleep with artificial intelligence enables quantification of sleep stage ambiguity: hypnodensity based on multiple expert scorers and auto-scoring.

机构信息

Philips Sleep and Respiratory Care, Pittsburgh, PA,USA.

Philips Sleep and Respiratory Care, Vienna, Austria.

出版信息

Sleep. 2023 Feb 8;46(2). doi: 10.1093/sleep/zsac154.

DOI:10.1093/sleep/zsac154
PMID:35780449
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9905781/
Abstract

STUDY OBJECTIVES

To quantify the amount of sleep stage ambiguity across expert scorers and to validate a new auto-scoring platform against sleep staging performed by multiple scorers.

METHODS

We applied a new auto-scoring system to three datasets containing 95 PSGs scored by 6-12 scorers, to compare sleep stage probabilities (hypnodensity; i.e. the probability of each sleep stage being assigned to a given epoch) as the primary output, as well as a single sleep stage per epoch assigned by hierarchical majority rule.

RESULTS

The percentage of epochs with 100% agreement across scorers was 46 ± 9%, 38 ± 10% and 32 ± 9% for the datasets with 6, 9, and 12 scorers, respectively. The mean intra-class correlation coefficient between sleep stage probabilities from auto- and manual-scoring was 0.91, representing excellent reliability. Within each dataset, agreement between auto-scoring and consensus manual-scoring was significantly higher than agreement between manual-scoring and consensus manual-scoring (0.78 vs. 0.69; 0.74 vs. 0.67; and 0.75 vs. 0.67; all p < 0.01).

CONCLUSIONS

Analysis of scoring performed by multiple scorers reveals that sleep stage ambiguity is the rule rather than the exception. Probabilities of the sleep stages determined by artificial intelligence auto-scoring provide an excellent estimate of this ambiguity. Compared to consensus manual-scoring, sleep staging derived from auto-scoring is for each individual PSG noninferior to manual-scoring meaning that auto-scoring output is ready for interpretation without the need for manual adjustment.

摘要

研究目的

量化专家评分者之间的睡眠分期歧义程度,并验证一个新的自动评分平台与多位评分者进行的睡眠分期的准确性。

方法

我们应用一个新的自动评分系统对包含 6-12 位评分者的三个数据集的 95 次 PSG 进行评估,以比较睡眠分期概率(即每个睡眠分期被分配给特定时段的概率)作为主要输出,以及每个时段通过分层多数规则分配的单一睡眠分期。

结果

在有 6、9 和 12 位评分者的数据集上,评分者之间 100%一致的时段百分比分别为 46 ± 9%、38 ± 10%和 32 ± 9%。自动和手动评分的睡眠分期概率之间的平均组内相关系数为 0.91,代表极好的可靠性。在每个数据集内,自动评分与共识手动评分之间的一致性显著高于手动评分与共识手动评分之间的一致性(0.78 比 0.69;0.74 比 0.67;0.75 比 0.67;均 p < 0.01)。

结论

对多位评分者的评分分析表明,睡眠分期的歧义是常态而非异常。人工智能自动评分确定的睡眠分期概率提供了对这种歧义的极好估计。与共识手动评分相比,自动评分得出的睡眠分期对于每个单独的 PSG 来说并不逊于手动评分,这意味着自动评分结果无需手动调整即可进行解释。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c97d/9905781/d00b05b76b5b/zsac154f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c97d/9905781/e00861747cef/zsac154f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c97d/9905781/c461272721f7/zsac154f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c97d/9905781/42d523c2f507/zsac154f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c97d/9905781/fa137b32f132/zsac154f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c97d/9905781/d00b05b76b5b/zsac154f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c97d/9905781/e00861747cef/zsac154f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c97d/9905781/c461272721f7/zsac154f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c97d/9905781/42d523c2f507/zsac154f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c97d/9905781/fa137b32f132/zsac154f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c97d/9905781/d00b05b76b5b/zsac154f0005.jpg

相似文献

1
Scoring sleep with artificial intelligence enables quantification of sleep stage ambiguity: hypnodensity based on multiple expert scorers and auto-scoring.人工智能评分可量化睡眠分期的不明确性:基于多位专家评分和自动评分的催眠密度。
Sleep. 2023 Feb 8;46(2). doi: 10.1093/sleep/zsac154.
2
Interrater sleep stage scoring reliability between manual scoring from two European sleep centers and automatic scoring performed by the artificial intelligence-based Stanford-STAGES algorithm.两个欧洲睡眠中心的手动评分与基于人工智能的斯坦福-STAGES 算法的自动评分之间的睡眠分期评分者间可靠性。
J Clin Sleep Med. 2021 Jun 1;17(6):1237-1247. doi: 10.5664/jcsm.9174.
3
Staging Sleep in Polysomnograms: Analysis of Inter-Scorer Variability.多导睡眠图中的睡眠分期:评分者间变异性分析
J Clin Sleep Med. 2016 Jun 15;12(6):885-94. doi: 10.5664/jcsm.5894.
4
Minimizing Interrater Variability in Staging Sleep by Use of Computer-Derived Features.通过使用计算机衍生特征来最小化睡眠分期中的评分者间差异。
J Clin Sleep Med. 2016 Oct 15;12(10):1347-1356. doi: 10.5664/jcsm.6186.
5
Scoring accuracy of automated sleep staging from a bipolar electroocular recording compared to manual scoring by multiple raters.双极眼动记录的自动睡眠分期与多位评分者的手动评分的准确性比较。
Sleep Med. 2013 Nov;14(11):1199-207. doi: 10.1016/j.sleep.2013.04.022. Epub 2013 Aug 16.
6
An approach for determining the reliability of manual and digital scoring of sleep stages.一种用于确定睡眠阶段手动和数字评分可靠性的方法。
Sleep. 2023 Nov 8;46(11). doi: 10.1093/sleep/zsad248.
7
Multicentre sleep-stage scoring agreement in the Sleep Revolution project.多中心睡眠分期评分在睡眠革命项目中的一致性。
J Sleep Res. 2024 Feb;33(1):e13956. doi: 10.1111/jsr.13956. Epub 2023 Jun 13.
8
The American Academy of Sleep Medicine inter-scorer reliability program: sleep stage scoring.美国睡眠医学学会再评分者可靠性方案:睡眠分期。
J Clin Sleep Med. 2013 Jan 15;9(1):81-7. doi: 10.5664/jcsm.2350.
9
Polysomnography scoring-related training and quantitative assessment for improving interscorer agreement.多导睡眠图评分相关的培训和定量评估,以提高评分者间的一致性。
J Clin Sleep Med. 2024 Feb 1;20(2):271-278. doi: 10.5664/jcsm.10852.
10
Automated Scoring of Sleep and Associated Events.睡眠及相关事件的自动评分。
Adv Exp Med Biol. 2022;1384:107-130. doi: 10.1007/978-3-031-06413-5_7.

引用本文的文献

1
Validation of the Somnolyzer 24×7 automatic scoring system in children with suspected obstructive sleep apnea.Somnolyzer 24×7自动评分系统在疑似阻塞性睡眠呼吸暂停儿童中的验证
Front Med (Lausanne). 2025 Jun 18;12:1617530. doi: 10.3389/fmed.2025.1617530. eCollection 2025.
2
Beyond accuracy: a framework for evaluating algorithmic bias and performance, applied to automated sleep scoring.超越准确性:评估算法偏差与性能的框架,应用于自动睡眠评分
Sci Rep. 2025 Jul 1;15(1):21421. doi: 10.1038/s41598-025-06019-4.
3
Modeling of whole brain sleep electroencephalogram using deep oscillatory neural network.

本文引用的文献

1
A meta-analysis of the first-night effect in healthy individuals for the full age spectrum.对全年龄段健康个体首夜效应的荟萃分析。
Sleep Med. 2022 Jan;89:159-165. doi: 10.1016/j.sleep.2021.12.007. Epub 2021 Dec 17.
2
An open-source, high-performance tool for automated sleep staging.一个用于自动睡眠分期的开源、高性能工具。
Elife. 2021 Oct 14;10:e70092. doi: 10.7554/eLife.70092.
3
Inter-database validation of a deep learning approach for automatic sleep scoring.深度学习方法自动睡眠评分的跨数据库验证。
使用深度振荡神经网络对全脑睡眠脑电图进行建模。
Front Neuroinform. 2025 May 14;19:1513374. doi: 10.3389/fninf.2025.1513374. eCollection 2025.
4
Open-Source Algorithm for Automated Vigilance State Classification Using Single-Channel Electroencephalogram in Rodents.使用啮齿动物单通道脑电图进行自动警觉状态分类的开源算法
Sensors (Basel). 2025 Feb 3;25(3):921. doi: 10.3390/s25030921.
5
A Hierarchical Framework for Selecting Reference Measures for the Analytical Validation of Sensor-Based Digital Health Technologies.用于为基于传感器的数字健康技术的分析验证选择参考测量的分层框架。
J Med Internet Res. 2025 Feb 7;27:e58956. doi: 10.2196/58956.
6
Synergistic integration of brain networks and time-frequency multi-view feature for sleep stage classification.用于睡眠阶段分类的脑网络与时频多视图特征的协同整合。
Health Inf Sci Syst. 2025 Jan 10;13(1):15. doi: 10.1007/s13755-024-00328-0. eCollection 2025 Dec.
7
Retrospective validation of automatic sleep analysis with grey areas model for human-in-the-loop scoring approach.采用灰区模型的自动睡眠分析在人工参与评分方法中的回顾性验证
J Sleep Res. 2025 Jun;34(3):e14362. doi: 10.1111/jsr.14362. Epub 2024 Oct 23.
8
A deep learning algorithm model to automatically score and grade obstructive sleep apnea in adult polysomnography.一种用于在成人多导睡眠图中自动对阻塞性睡眠呼吸暂停进行评分和分级的深度学习算法模型。
Digit Health. 2024 Oct 17;10:20552076241291707. doi: 10.1177/20552076241291707. eCollection 2024 Jan-Dec.
9
Overnight Sleep Staging Using Chest-Worn Accelerometry.使用胸部佩戴加速度计进行夜间睡眠分期。
Sensors (Basel). 2024 Sep 2;24(17):5717. doi: 10.3390/s24175717.
10
NAPping PAnts (NAPPA): An open wearable solution for monitoring Infant's sleeping rhythms, respiration and posture.用于监测婴儿睡眠节律、呼吸和姿势的开放式可穿戴解决方案——NAPping PAnts(新生儿活动与姿势监测裤)
Heliyon. 2024 Jun 21;10(13):e33295. doi: 10.1016/j.heliyon.2024.e33295. eCollection 2024 Jul 15.
PLoS One. 2021 Aug 16;16(8):e0256111. doi: 10.1371/journal.pone.0256111. eCollection 2021.
4
Boosting automated sleep staging performance in big datasets using population subgrouping.利用人群亚组提高大数据集中自动睡眠分期的性能。
Sleep. 2021 Jul 9;44(7). doi: 10.1093/sleep/zsab027.
5
U-Sleep: resilient high-frequency sleep staging.U-Sleep:弹性高频睡眠分期
NPJ Digit Med. 2021 Apr 15;4(1):72. doi: 10.1038/s41746-021-00440-5.
6
Estimating sleep stages using cardiorespiratory signals: validation of a novel algorithm across a wide range of sleep-disordered breathing severity.使用心肺信号估计睡眠阶段:一种新算法在广泛的睡眠呼吸障碍严重程度中的验证。
J Clin Sleep Med. 2021 Jul 1;17(7):1343-1354. doi: 10.5664/jcsm.9192.
7
Interrater sleep stage scoring reliability between manual scoring from two European sleep centers and automatic scoring performed by the artificial intelligence-based Stanford-STAGES algorithm.两个欧洲睡眠中心的手动评分与基于人工智能的斯坦福-STAGES 算法的自动评分之间的睡眠分期评分者间可靠性。
J Clin Sleep Med. 2021 Jun 1;17(6):1237-1247. doi: 10.5664/jcsm.9174.
8
Automatic sleep stage classification with deep residual networks in a mixed-cohort setting.基于深度残差网络的混合队列自动睡眠分期。
Sleep. 2021 Jan 21;44(1). doi: 10.1093/sleep/zsaa161.
9
Dreem Open Datasets: Multi-Scored Sleep Datasets to Compare Human and Automated Sleep Staging.Dreem开放数据集:用于比较人类和自动睡眠分期的多评分睡眠数据集。
IEEE Trans Neural Syst Rehabil Eng. 2020 Sep;28(9):1955-1965. doi: 10.1109/TNSRE.2020.3011181. Epub 2020 Jul 22.
10
Expert-level automated sleep staging of long-term scalp electroencephalography recordings using deep learning.使用深度学习对长期头皮脑电图记录进行专家级别的自动睡眠分期。
Sleep. 2020 Nov 12;43(11). doi: 10.1093/sleep/zsaa112.