• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

放射学研究中评估评分者一致性的一些常用统计方法。

Some common statistical methods for assessing rater agreement in radiological studies.

作者信息

Geijer Mats, Båth Magnus, Wessman Catrin

机构信息

Department of Radiology, Institute of Clinical Sciences, Sahlgrenska Academy, University of Gothenburg, Gothenburg, Sweden.

Department of Radiology, Region Västra Götaland, Sahlgrenska University Hospital, Gothenburg, Sweden.

出版信息

Acta Radiol. 2025 Jun;66(6):675-683. doi: 10.1177/02841851251319666. Epub 2025 Feb 23.

DOI:10.1177/02841851251319666
PMID:39988909
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12163189/
Abstract

Rater agreement is commonly assessed in radiologic studies concerning image quality. There are several methods of measuring rater agreement. To choose the appropriate method, the researcher needs to consider the scale of the outcome variable and the design of the study. This article provides a brief overview of available methods, focusing on the most practical and commonly used, including intraclass correlation, the Svensson method, variants of the kappa statistic, the agreement coefficient by Gwet (AC1/AC2), and Krippendorff's alpha. Additional methods that are not primarily intended for rater agreement analysis but are applied in some cases are also discussed.

摘要

在有关图像质量的放射学研究中,评分者间的一致性通常会得到评估。有几种测量评分者间一致性的方法。为了选择合适的方法,研究者需要考虑结果变量的尺度和研究设计。本文简要概述了可用的方法,重点介绍了最实用且常用的方法,包括组内相关系数、斯文森方法、kappa统计量的变体、格韦特一致性系数(AC1/AC2)以及克里彭多夫α系数。还讨论了一些并非主要用于评分者间一致性分析但在某些情况下会应用的其他方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a36/12163189/6fe6868c7560/10.1177_02841851251319666-fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a36/12163189/6fe6868c7560/10.1177_02841851251319666-fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a36/12163189/6fe6868c7560/10.1177_02841851251319666-fig1.jpg

相似文献

1
Some common statistical methods for assessing rater agreement in radiological studies.放射学研究中评估评分者一致性的一些常用统计方法。
Acta Radiol. 2025 Jun;66(6):675-683. doi: 10.1177/02841851251319666. Epub 2025 Feb 23.
2
Measuring inter-rater reliability for nominal data - which coefficients and confidence intervals are appropriate?测量名义数据的评分者间信度——哪些系数和置信区间是合适的?
BMC Med Res Methodol. 2016 Aug 5;16:93. doi: 10.1186/s12874-016-0200-9.
3
Interobserver agreement issues in radiology.放射学中的观察者间一致性问题。
Diagn Interv Imaging. 2020 Oct;101(10):639-641. doi: 10.1016/j.diii.2020.09.001. Epub 2020 Sep 18.
4
Inter-rater reliability of occupational exposure assessment in a case-control study of female breast cancer.职业暴露评估在女性乳腺癌病例对照研究中的评价者间信度。
J Occup Environ Hyg. 2021 Oct-Nov;18(10-11):522-531. doi: 10.1080/15459624.2021.1976412. Epub 2021 Oct 11.
5
Homogeneity score test of AC statistics and estimation of common AC in multiple or stratified inter-rater agreement studies.多或分层组内一致性研究中 AC 统计量的同质性检验和共同 AC 的估计。
BMC Med Res Methodol. 2020 Feb 5;20(1):20. doi: 10.1186/s12874-019-0887-5.
6
Intra and inter observer agreement in the mobility assessment of the upper thoracic costovertebral joints.在上胸肋椎关节活动度评估中的观察者内和观察者间的一致性。
Physiother Theory Pract. 2023 Sep 2;39(9):1993-1999. doi: 10.1080/09593985.2022.2058439. Epub 2022 Mar 27.
7
Grading lumbar foraminal stenosis - Interrater agreement of radiologists and radiology trainees before and after education of a standardised grading scale.腰椎侧隐窝狭窄分级-在对标准化分级量表进行教育前后,放射科医生和放射科受训者之间的分级一致性。
J Med Imaging Radiat Oncol. 2024 Aug;68(5):511-515. doi: 10.1111/1754-9485.13669. Epub 2024 May 15.
8
Reliability in evaluator-based tests: using simulation-constructed models to determine contextually relevant agreement thresholds.基于评估者的测试的可靠性:使用模拟构建的模型确定上下文相关的一致性阈值。
BMC Med Res Methodol. 2018 Nov 19;18(1):141. doi: 10.1186/s12874-018-0606-7.
9
A scale of methodological quality for clinical studies of radiologic examinations.放射学检查临床研究的方法学质量量表。
Radiology. 2000 Oct;217(1):69-74. doi: 10.1148/radiology.217.1.r00oc0669.
10
Appropriate Statistics for Determining Chance-Removed Interpractitioner Agreement.确定机会消除后从业者间一致性的适当统计方法。
J Altern Complement Med. 2019 Nov;25(11):1115-1120. doi: 10.1089/acm.2017.0297. Epub 2018 May 31.

引用本文的文献

1
Virtual Non-Contrast Reconstructions Derived from Dual-Energy CTA Scans in Peripheral Arterial Disease: Comparison with True Non-Contrast Images and Impact on Radiation Dose.基于双能量CT血管造影扫描的外周动脉疾病虚拟非增强重建:与真实非增强图像的比较及对辐射剂量的影响
J Clin Med. 2025 Aug 7;14(15):5571. doi: 10.3390/jcm14155571.

本文引用的文献

1
Gwet's AC1 is not a substitute for Cohen's kappa - A comparison of basic properties.格韦特AC1不能替代科恩kappa系数——基本特性比较
MethodsX. 2023 May 10;10:102212. doi: 10.1016/j.mex.2023.102212. eCollection 2023.
2
The Bland-Altman method should not be used when one of the two measurement methods has negligible measurement errors.当两种测量方法中的一种具有可忽略的测量误差时,不应使用 Bland-Altman 方法。
PLoS One. 2022 Dec 12;17(12):e0278915. doi: 10.1371/journal.pone.0278915. eCollection 2022.
3
EVALUATION OF VGC ANALYZER BY COMPARISON WITH GOLD STANDARD ROC SOFTWARE AND ANALYSIS OF SIMULATED VISUAL GRADING DATA.
通过与金标准 ROC 软件的比较评估 VGC 分析器及模拟视觉分级数据的分析。
Radiat Prot Dosimetry. 2021 Oct 12;195(3-4):378-390. doi: 10.1093/rpd/ncab066.
4
Pre- and postoperative offset and femoral neck version measurements and validation using 3D computed tomography in total hip arthroplasty.全髋关节置换术中使用三维计算机断层扫描进行术前和术后偏移及股骨颈扭转测量与验证
Acta Radiol Open. 2020 Oct 8;9(10):2058460120964911. doi: 10.1177/2058460120964911. eCollection 2020 Oct.
5
Intraclass correlation - A discussion and demonstration of basic features.组内相关系数 - 基本特征的讨论与演示。
PLoS One. 2019 Jul 22;14(7):e0219854. doi: 10.1371/journal.pone.0219854. eCollection 2019.
6
Hubert's multi-rater kappa revisited.再探休伯尔氏多评估者 κ 系数。
Br J Math Stat Psychol. 2020 Feb;73(1):1-22. doi: 10.1111/bmsp.12167. Epub 2019 May 6.
7
CT Detectability of Small Low-Contrast Hypoattenuating Focal Lesions: Iterative Reconstructions versus Filtered Back Projection.CT 检测小低对比率局灶性低衰减病灶的能力:迭代重建与滤波反投影的比较。
Radiology. 2018 Nov;289(2):443-454. doi: 10.1148/radiol.2018180137. Epub 2018 Jul 17.
8
Summary measures of agreement and association between many raters' ordinal classifications.多位评估者的有序分类之间一致性和关联性的汇总指标。
Ann Epidemiol. 2017 Oct;27(10):677-685.e4. doi: 10.1016/j.annepidem.2017.09.001. Epub 2017 Sep 22.
9
Measuring inter-rater reliability for nominal data - which coefficients and confidence intervals are appropriate?测量名义数据的评分者间信度——哪些系数和置信区间是合适的?
BMC Med Res Methodol. 2016 Aug 5;16:93. doi: 10.1186/s12874-016-0200-9.
10
A Guideline of Selecting and Reporting Intraclass Correlation Coefficients for Reliability Research.可靠性研究中组内相关系数选择与报告指南
J Chiropr Med. 2016 Jun;15(2):155-63. doi: 10.1016/j.jcm.2016.02.012. Epub 2016 Mar 31.