• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一份研究多观察者有序数据一致性和可靠性的综合指南。

A comprehensive guide to study the agreement and reliability of multi-observer ordinal data.

作者信息

Vanbelle Sophie, Engelhart Christina Hernandez, Blix Ellen

机构信息

Methodology and Statistics, CAPHRI, Maastricht University, P. Debyeplein, 1, Maastricht, 6229 HA, The Netherlands.

Norwegian Research Center for Women's Health, Oslo University Hospital, P.O box 4950 Nydalen, Oslo, N-0424, Norway.

出版信息

BMC Med Res Methodol. 2024 Dec 20;24(1):310. doi: 10.1186/s12874-024-02431-y.

DOI:10.1186/s12874-024-02431-y
PMID:39707223
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11660713/
Abstract

BACKGROUND

A recent systematic review revealed issues in regard to performing and reporting agreement and reliability studies for ordinal scales, especially in the presence of more than two observers. This paper therefore aims to provide all necessary information in regard to the choice among the most meaningful and most used measures and the planning of agreement and reliability studies for ordinal outcomes.

METHODS

This paper considers the generalisation of the proportion of (dis)agreement, the mean absolute deviation, the mean squared deviation and weighted kappa coefficients to more than two observers in the presence of an ordinal outcome.

RESULTS

After highlighting the difference between the concepts of agreement and reliability, a clear and simple interpretation of the agreement and reliability coefficients is provided. The large sample variance of the various coefficients with the delta method is presented or derived if not available in the literature to construct Wald confidence intervals. Finally, a procedure to determine the minimum number of raters and patients needed to limit the uncertainty associated with the sampling process is provided. All the methods are available in an R package and a Shiny application to circumvent the limitations of current software.

CONCLUSIONS

The present paper completes existing guidelines, such as the Guidelines for Reporting Reliability and Agreement Studies (GRRAS), to improve the quality of reliability and agreement studies of clinical tests. Furthermore, we provide open source software to researchers with minimum programming skills.

摘要

背景

最近的一项系统评价揭示了在进行和报告有序量表的一致性和可靠性研究方面存在的问题,尤其是在有两名以上观察者的情况下。因此,本文旨在提供有关在最有意义和最常用的测量方法中进行选择以及针对有序结果进行一致性和可靠性研究规划的所有必要信息。

方法

本文考虑了在有序结果存在的情况下,将(不)一致比例、平均绝对偏差、均方偏差和加权kappa系数推广到两名以上观察者的情况。

结果

在强调了一致性和可靠性概念之间的差异之后,对一致性和可靠性系数进行了清晰简单的解释。如果文献中没有提供用德尔塔法计算各种系数的大样本方差,本文将给出或推导该方差,以构建Wald置信区间。最后,提供了一种确定评分者和患者的最小数量的程序,以限制与抽样过程相关的不确定性。所有方法都可以在一个R包和一个Shiny应用程序中使用,以规避当前软件的局限性。

结论

本文完善了现有指南,如《报告可靠性和一致性研究指南》(GRRAS),以提高临床试验可靠性和一致性研究的质量。此外,我们为编程技能最低的研究人员提供了开源软件。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2f67/11660713/40e61449d10b/12874_2024_2431_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2f67/11660713/40e61449d10b/12874_2024_2431_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2f67/11660713/40e61449d10b/12874_2024_2431_Fig1_HTML.jpg

相似文献

1
A comprehensive guide to study the agreement and reliability of multi-observer ordinal data.一份研究多观察者有序数据一致性和可靠性的综合指南。
BMC Med Res Methodol. 2024 Dec 20;24(1):310. doi: 10.1186/s12874-024-02431-y.
2
A New Interpretation of the Weighted Kappa Coefficients.加权卡帕系数的一种新解释。
Psychometrika. 2016 Jun;81(2):399-410. doi: 10.1007/s11336-014-9439-4. Epub 2014 Dec 17.
3
Guidelines for Reporting Reliability and Agreement Studies (GRRAS) were proposed.报告可靠性和一致性研究(GRRAS)指南被提出。
J Clin Epidemiol. 2011 Jan;64(1):96-106. doi: 10.1016/j.jclinepi.2010.03.002. Epub 2010 Jun 17.
4
Measures of clinical agreement for nominal and categorical data: the kappa coefficient.名义数据和分类数据的临床一致性测量:kappa系数。
Comput Biol Med. 1992 Jul;22(4):239-46. doi: 10.1016/0010-4825(92)90063-s.
5
Interrater agreement and interrater reliability: key concepts, approaches, and applications.评定者间一致性和评定者间信度:关键概念、方法和应用。
Res Social Adm Pharm. 2013 May-Jun;9(3):330-8. doi: 10.1016/j.sapharm.2012.04.004. Epub 2012 Jun 12.
6
The kappa statistic in rehabilitation research: an examination.康复研究中的kappa统计量:一项考察。
Arch Phys Med Rehabil. 2004 Aug;85(8):1371-6. doi: 10.1016/j.apmr.2003.12.002.
7
Agreement Analysis: What He Said, She Said Versus You Said.一致性分析:他说、她说与你说的。
Anesth Analg. 2018 Jun;126(6):2123-2128. doi: 10.1213/ANE.0000000000002924.
8
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
9
Reliability analysis for continuous measurements: equivalence test for agreement.连续测量的可靠性分析:一致性的等效性检验。
Stat Med. 2008 Jul 10;27(15):2816-25. doi: 10.1002/sim.3110.
10
Assessing the reliability of ordered categorical scales using kappa-type statistics.使用kappa型统计量评估有序分类量表的可靠性。
Stat Methods Med Res. 2005 Oct;14(5):493-514. doi: 10.1191/0962280205sm413oa.

本文引用的文献

1
Measures of Agreement with Multiple Raters: Fréchet Variances and Inference.多评分者一致性的度量:Fréchet 方差和推断。
Psychometrika. 2024 Jun;89(2):517-541. doi: 10.1007/s11336-023-09945-2. Epub 2024 Jan 8.
2
Reliability and agreement in intrapartum fetal heart rate monitoring interpretation: A systematic review.产时胎儿心率监测解读的可靠性和一致性:系统评价。
Acta Obstet Gynecol Scand. 2023 Aug;102(8):970-985. doi: 10.1111/aogs.14591. Epub 2023 Jun 13.
3
Measuring Agreement Using Guessing Models and Knowledge Coefficients.
使用猜测模型和知识系数来衡量一致性。
Psychometrika. 2023 Sep;88(3):1002-1025. doi: 10.1007/s11336-023-09919-4. Epub 2023 Jun 8.
4
How Replicates Can Inform Potential Users of a Measurement Procedure about Measurement Error: Basic Concepts and Methods.重复测量如何向测量程序的潜在用户告知测量误差:基本概念与方法
Diagnostics (Basel). 2021 Jan 22;11(2):162. doi: 10.3390/diagnostics11020162.
5
Modeling agreement on bounded scales.在有界尺度上的模型一致性。
Stat Methods Med Res. 2018 Nov;27(11):3460-3477. doi: 10.1177/0962280217705709. Epub 2017 May 8.
6
FIGO consensus guidelines on intrapartum fetal monitoring: Cardiotocography.国际妇产科联盟(FIGO)关于产时胎儿监测的共识指南:胎心监护
Int J Gynaecol Obstet. 2015 Oct;131(1):13-24. doi: 10.1016/j.ijgo.2015.06.020.
7
A New Interpretation of the Weighted Kappa Coefficients.加权卡帕系数的一种新解释。
Psychometrika. 2016 Jun;81(2):399-410. doi: 10.1007/s11336-014-9439-4. Epub 2014 Dec 17.
8
The agreement chart.协议图表。
BMC Med Res Methodol. 2013 Jul 29;13:97. doi: 10.1186/1471-2288-13-97.
9
Guidelines for Reporting Reliability and Agreement Studies (GRRAS) were proposed.报告可靠性和一致性研究(GRRAS)指南被提出。
J Clin Epidemiol. 2011 Jan;64(1):96-106. doi: 10.1016/j.jclinepi.2010.03.002. Epub 2010 Jun 17.
10
Weighted kappa: nominal scale agreement with provision for scaled disagreement or partial credit.加权kappa系数:用于衡量名义尺度上的一致性,并考虑了尺度不一致或部分得分的情况。
Psychol Bull. 1968 Oct;70(4):213-20. doi: 10.1037/h0026256.