中美睡眠中心之间根据 2014 年 AASM 标准的一致性。

Interrater agreement between American and Chinese sleep centers according to the 2014 AASM standard.

机构信息

School of Biological Science and Medical Engineering and Research Institute, Beihang University, Shenzhen, China.

Key Laboratory of Biomechanics and Mechanobiology of Ministry of Education, Beihang University, Beijing, 100191, China.

出版信息

Sleep Breath. 2019 Jun;23(2):719-728. doi: 10.1007/s11325-019-01801-x. Epub 2019 Feb 19.

DOI:10.1007/s11325-019-01801-x

PMID:30783913

Abstract

OBJECTIVES

To determine inter-lab reliability in sleep stage scoring using the 2014 American Academy of Sleep Medicine (AASM) manual. To understand in-depth reasons for disagreement and provide suggestions for improvement.

METHODS

This study consisted of 40 all-night polysomnographys (PSGs) from different samples. PSGs were segmented into 37,642 30-s epochs. Five doctors from China and two doctors from America scored the epochs following the 2014 AASM standard. Scoring disagreement between two centers was evaluated using Cohen's kappa (κ). After visual inspection of PSGs of deviating scorings, potential disagreement reasons were analyzed.

RESULTS

Inter-lab reliability yielded a substantial degree (κ = 0.75 ± 0.01). Scoring for stage W (κ = 0.89) and R (κ = 0.87) achieved the highest agreement, while stage N1 (κ = 0.45) reflected the lowest. Considering the relative disagreement ratio, N2-N3 (22.09%), W-N1 (19.68%), and N1-N2 (18.75%) were the most frequent combinations of discrepancy. American and Chinese doctors showed certain characteristics in the scoring of discrepancy combination W-N1, N1-N2, and N2-N3. There are seven reasons for disagreement, namely "on-threshold characteristic" (29.21%), "context influence" (18.06%), "characteristic identification difficulty" (8.81%), "arousal-wake confusion" (7.57%), "derivation inconsistence" (2.15%), "on-borderline characteristic" (0.92%), and "misrecognition" (33.27%).

CONCLUSIONS

This study demonstrated the sleep stage scoring agreement of the 2014 AASM manual and explored potential sources of labeling ambiguity. Improvement measures were suggested accordingly to help remove ambiguity for scorers and improve scoring reliability at the international level.

摘要

目的

使用 2014 年美国睡眠医学学会（AASM）手册确定睡眠分期评分的实验室间可靠性。深入了解分歧的原因，并提出改进建议。

方法

本研究包括来自不同样本的 40 个整夜多导睡眠图（PSG）。PSG 被分割成 37642 个 30 秒的时相。来自中国的 5 位医生和来自美国的 2 位医生按照 2014 年 AASM 标准对时相进行评分。使用 Cohen's kappa（κ）评估两个中心之间的评分分歧。在对偏离评分的 PSG 进行视觉检查后，分析潜在的分歧原因。

结果

实验室间可靠性达到了较高的程度（κ=0.75±0.01）。W 期（κ=0.89）和 R 期（κ=0.87）的评分具有最高的一致性，而 N1 期（κ=0.45）则反映了最低的一致性。考虑到相对分歧率，N2-N3（22.09%）、W-N1（19.68%）和 N1-N2（18.75%）是分歧最常见的组合。美国和中国医生在评分差异组合 W-N1、N1-N2 和 N2-N3 方面表现出一定的特征。分歧的原因有七个，即“阈值特征”（29.21%）、“背景影响”（18.06%）、“特征识别困难”（8.81%）、“觉醒-唤醒混淆”（7.57%）、“不一致推断”（2.15%）、“边界特征”（0.92%）和“误识别”（33.27%）。

结论

本研究表明，2014 年 AASM 手册的睡眠分期评分具有一致性，并探讨了标记不明确的潜在来源。相应地提出了改进措施，以帮助评分者消除歧义，提高国际水平的评分可靠性。

相似文献

Interrater agreement between American and Chinese sleep centers according to the 2014 AASM standard.

Sleep Breath. 2019 Jun;23(2):719-728. doi: 10.1007/s11325-019-01801-x. Epub 2019 Feb 19.

Process and outcome for international reliability in sleep scoring.

Sleep Breath. 2015 Mar;19(1):191-5. doi: 10.1007/s11325-014-0990-0. Epub 2014 May 7.

The 2007 AASM recommendations for EEG electrode placement in polysomnography: impact on sleep and cortical arousal scoring.

Sleep. 2011 Jan 1;34(1):73-81. doi: 10.1093/sleep/34.1.73.

Interrater reliability of sleep stage scoring: a meta-analysis.

J Clin Sleep Med. 2022 Jan 1;18(1):193-202. doi: 10.5664/jcsm.9538.

The American Academy of Sleep Medicine inter-scorer reliability program: sleep stage scoring.

J Clin Sleep Med. 2013 Jan 15;9(1):81-7. doi: 10.5664/jcsm.2350.

Interrater reliability for sleep scoring according to the Rechtschaffen & Kales and the new AASM standard.

J Sleep Res. 2009 Mar;18(1):74-84. doi: 10.1111/j.1365-2869.2008.00700.x.

Scoring accuracy of automated sleep staging from a bipolar electroocular recording compared to manual scoring by multiple raters.

Sleep Med. 2013 Nov;14(11):1199-207. doi: 10.1016/j.sleep.2013.04.022. Epub 2013 Aug 16.

Interrater sleep stage scoring reliability between manual scoring from two European sleep centers and automatic scoring performed by the artificial intelligence-based Stanford-STAGES algorithm.

J Clin Sleep Med. 2021 Jun 1;17(6):1237-1247. doi: 10.5664/jcsm.9174.

Agreement in the scoring of respiratory events and sleep among international sleep centers.

Sleep. 2013 Apr 1;36(4):591-6. doi: 10.5665/sleep.2552.

The American Academy of Sleep Medicine Inter-scorer Reliability program: respiratory events.

J Clin Sleep Med. 2014 Apr 15;10(4):447-54. doi: 10.5664/jcsm.3630.

引用本文的文献

Somnotate: A probabilistic sleep stage classifier for studying vigilance state transitions.

PLoS Comput Biol. 2024 Jan 17;20(1):e1011793. doi: 10.1371/journal.pcbi.1011793. eCollection 2024 Jan.

Polysomnography scoring-related training and quantitative assessment for improving interscorer agreement.

J Clin Sleep Med. 2024 Feb 1;20(2):271-278. doi: 10.5664/jcsm.10852.

A two-branch trade-off neural network for balanced scoring sleep stages on multiple cohorts.

Front Neurosci. 2023 Jun 23;17:1176551. doi: 10.3389/fnins.2023.1176551. eCollection 2023.

Spotlight on Sleep Stage Classification Based on EEG.

Nat Sci Sleep. 2023 Jun 29;15:479-490. doi: 10.2147/NSS.S401270. eCollection 2023.

[Study on the method of polysomnography sleep stage staging based on attention mechanism and bidirectional gate recurrent unit].

Sheng Wu Yi Xue Gong Cheng Xue Za Zhi. 2023 Feb 25;40(1):35-43. doi: 10.7507/1001-5515.202208017.

Computer-assisted analysis of polysomnographic recordings improves inter-scorer associated agreement and scoring times.

PLoS One. 2022 Sep 29;17(9):e0275530. doi: 10.1371/journal.pone.0275530. eCollection 2022.

Scoring sleep with artificial intelligence enables quantification of sleep stage ambiguity: hypnodensity based on multiple expert scorers and auto-scoring.

Sleep. 2023 Feb 8;46(2). doi: 10.1093/sleep/zsac154.

Validation Study on Automated Sleep Stage Scoring Using a Deep Learning Algorithm.

Medicina (Kaunas). 2022 Jun 9;58(6):779. doi: 10.3390/medicina58060779.

Inter-database validation of a deep learning approach for automatic sleep scoring.

PLoS One. 2021 Aug 16;16(8):e0256111. doi: 10.1371/journal.pone.0256111. eCollection 2021.

Interrater reliability of sleep stage scoring: a meta-analysis.

J Clin Sleep Med. 2022 Jan 1;18(1):193-202. doi: 10.5664/jcsm.9538.

本文引用的文献

Sleep and Respiration in 100 Healthy Caucasian Sleepers--A Polysomnographic Study According to American Academy of Sleep Medicine Standards.

Sleep. 2015 Jun 1;38(6):867-75. doi: 10.5665/sleep.4730.

Process and outcome for international reliability in sleep scoring.

Sleep Breath. 2015 Mar;19(1):191-5. doi: 10.1007/s11325-014-0990-0. Epub 2014 May 7.

Agreement in the scoring of respiratory events and sleep among international sleep centers.

Sleep. 2013 Apr 1;36(4):591-6. doi: 10.5665/sleep.2552.

Inter-scorer reliability between sleep centers can teach us what to improve in the scoring rules.

J Clin Sleep Med. 2013 Jan 15;9(1):89-91. doi: 10.5664/jcsm.2352.

The American Academy of Sleep Medicine inter-scorer reliability program: sleep stage scoring.

J Clin Sleep Med. 2013 Jan 15;9(1):81-7. doi: 10.5664/jcsm.2350.

The 2007 AASM recommendations for EEG electrode placement in polysomnography: impact on sleep and cortical arousal scoring.

Sleep. 2011 Jan 1;34(1):73-81. doi: 10.1093/sleep/34.1.73.

Commentary from the Italian Association of Sleep Medicine on the AASM manual for the scoring of sleep and associated events: for debate and discussion.

Sleep Med. 2009 Aug;10(7):799-808. doi: 10.1016/j.sleep.2009.05.009. Epub 2009 Jun 28.

Interrater reliability for sleep scoring according to the Rechtschaffen & Kales and the new AASM standard.

J Sleep Res. 2009 Mar;18(1):74-84. doi: 10.1111/j.1365-2869.2008.00700.x.

The visual scoring of sleep in adults.

J Clin Sleep Med. 2007 Mar 15;3(2):121-31.

Discrepancy in polysomnography scoring for a patient with obstructive sleep apnea hypopnea syndrome.

Tohoku J Exp Med. 2005 Aug;206(4):353-60. doi: 10.1620/tjem.206.353.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

中美睡眠中心之间根据 2014 年 AASM 标准的一致性。

Interrater agreement between American and Chinese sleep centers according to the 2014 AASM standard.

机构信息

出版信息

OBJECTIVES

METHODS

RESULTS

CONCLUSIONS

目的

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献