• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一致性的机会校正测量有多可靠?

How reliable are chance-corrected measures of agreement?

作者信息

Guggenmoos-Holzmann I

机构信息

Department of Biostatistics and Medical Informatics, Free University of Berlin, Germany.

出版信息

Stat Med. 1993 Dec 15;12(23):2191-205. doi: 10.1002/sim.4780122305.

DOI:10.1002/sim.4780122305
PMID:8310189
Abstract

Chance-corrected measures of agreement are prone to exhibit paradoxical and counter-intuitive results when used as measures of reliability. It is demonstrated that these problems arise with Cohen's kappa as well as with Aickin's alpha. They are the consequence of an analogy to Simpson's paradox in mixed populations. It is further shown that chance-corrected measures of agreement may yield misleading values for binary ratings. It is concluded that improvements in the design and the analysis of reliability studies are a prerequisite for valid and pertinent results.

摘要

当用作可靠性度量时,一致性的机会校正度量容易出现矛盾和违反直觉的结果。结果表明,这些问题在科恩kappa系数以及艾金alpha系数中都会出现。它们是混合总体中与辛普森悖论类似情况的结果。进一步表明,一致性的机会校正度量对于二元评级可能会产生误导性的值。得出的结论是,可靠性研究的设计和分析的改进是获得有效和相关结果的先决条件。

相似文献

1
How reliable are chance-corrected measures of agreement?一致性的机会校正测量有多可靠?
Stat Med. 1993 Dec 15;12(23):2191-205. doi: 10.1002/sim.4780122305.
2
Kappa-like indices of observer agreement viewed from a latent class perspective.从潜在类别视角看观察者一致性的类kappa指数。
Stat Med. 1998 Apr 30;17(8):797-812. doi: 10.1002/(sici)1097-0258(19980430)17:8<797::aid-sim776>3.0.co;2-g.
3
[Quality criteria of assessment scales--Cohen's kappa as measure of interrator reliability (1)].评估量表的质量标准——作为评估者信度度量的科恩kappa系数(1)
Pflege. 2004 Feb;17(1):36-46. doi: 10.1024/1012-5302.17.1.36.
4
The meaning of kappa: probabilistic concepts of reliability and validity revisited.kappa的意义:可靠性和有效性的概率概念再探讨。
J Clin Epidemiol. 1996 Jul;49(7):775-82. doi: 10.1016/0895-4356(96)00011-x.
5
The Kappa Paradox Explained.卡帕悖论解析。
J Hand Surg Am. 2024 May;49(5):482-485. doi: 10.1016/j.jhsa.2024.01.006. Epub 2024 Feb 17.
6
Interobserver agreement: Cohen's kappa coefficient does not necessarily reflect the percentage of patients with congruent classifications.观察者间一致性:科恩卡方系数不一定反映分类一致的患者百分比。
Int J Clin Pharmacol Ther. 1997 Mar;35(3):93-5.
7
Chance-corrected measures for 2 × 2 tables that coincide with weighted kappa.2×2 列联表与加权kappa 一致的校正机遇测度。
Br J Math Stat Psychol. 2011 May;64(Pt 2):355-65. doi: 10.1348/2044-8317.002001. Epub 2010 Dec 7.
8
Interpreting kappa values for two-observer nursing diagnosis data.
Res Nurs Health. 1997 Oct;20(5):465-70. doi: 10.1002/(sici)1098-240x(199710)20:5<465::aid-nur10>3.0.co;2-8.
9
[Analyzing interrater agreement for categorical data using Cohen's kappa and alternative coefficients].[使用科恩kappa系数及其他系数分析分类数据的评分者间一致性]
Rehabilitation (Stuttg). 2007 Dec;46(6):370-7. doi: 10.1055/s-2007-976535.
10
The prediction of pouch of Douglas obliteration using offline analysis of the transvaginal ultrasound 'sliding sign' technique: inter- and intra-observer reproducibility.经阴道超声“滑动征”技术的离线分析预测道格拉斯窝消失:观察者间和观察者内的可重复性。
Hum Reprod. 2013 May;28(5):1237-46. doi: 10.1093/humrep/det044. Epub 2013 Mar 12.

引用本文的文献

1
Robustness of -type coefficients for clinical agreement.- 型系数用于临床一致性的稳健性。
Stat Med. 2022 May 20;41(11):1986-2004. doi: 10.1002/sim.9341. Epub 2022 Feb 6.
2
Degenerative findings on MRI of the cervical spine: an inter- and intra-rater reliability study.颈椎MRI的退行性改变:一项评分者间和评分者内可靠性研究。
Chiropr Man Therap. 2018 Oct 16;26:43. doi: 10.1186/s12998-018-0210-2. eCollection 2018.
3
A Monte Carlo-Based Bayesian Approach for Measuring Agreement in a Qualitative Scale.一种基于蒙特卡洛方法的贝叶斯定性量表一致性测量方法。
Appl Psychol Meas. 2015 May;39(3):189-207. doi: 10.1177/0146621614554080. Epub 2014 Nov 5.
4
Accuracy and agreement of PIRADSv2 for prostate cancer mpMRI: A multireader study.前列腺癌磁共振成像(mpMRI)中PIRADSv2的准确性和一致性:一项多阅片者研究
J Magn Reson Imaging. 2017 Feb;45(2):579-585. doi: 10.1002/jmri.25372. Epub 2016 Jul 8.
5
Observer agreement paradoxes in 2x2 tables: comparison of agreement measures.2×2列联表中的观察者一致性悖论:一致性度量的比较
BMC Med Res Methodol. 2014 Aug 28;14:100. doi: 10.1186/1471-2288-14-100.
6
Automatic detection of fast oscillations (40-200 Hz) in scalp EEG recordings.头皮 EEG 记录中快速振荡(40-200 Hz)的自动检测。
Clin Neurophysiol. 2012 Apr;123(4):670-80. doi: 10.1016/j.clinph.2011.07.050. Epub 2011 Sep 21.
7
Development and preliminary psychometric properties of a well-being index for medical students.医学生幸福感指数的编制与初步心理测量学特性研究。
BMC Med Educ. 2010 Jan 27;10:8. doi: 10.1186/1472-6920-10-8.
8
Assessing agreement between multiple raters with missing rating information, applied to breast cancer tumour grading.评估存在缺失评分信息的多个评分者之间的一致性,并将其应用于乳腺癌肿瘤分级。
PLoS One. 2008 Aug 13;3(8):e2925. doi: 10.1371/journal.pone.0002925.
9
Methodological considerations concerning the development of oral dental erosion indexes: literature survey, validity and reliability.关于口腔牙侵蚀指数发展的方法学考量:文献综述、有效性与可靠性
Clin Oral Investig. 2008 Mar;12 Suppl 1(Suppl 1):S51-8. doi: 10.1007/s00784-007-0178-0. Epub 2008 Jan 29.
10
How do statistical properties influence findings of tracking (maintenance) in epidemiologic studies? An example of research in tracking of obesity.统计特性如何影响流行病学研究中的追踪(维持)结果?肥胖追踪研究的一个例子。
Eur J Epidemiol. 2003;18(11):1037-45. doi: 10.1023/a:1026196310041.