• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

kappa的意义:可靠性和有效性的概率概念再探讨。

The meaning of kappa: probabilistic concepts of reliability and validity revisited.

作者信息

Guggenmoos-Holzmann I

机构信息

Institute of Medical Statistics and Information Science, Freie Universität Berlin, Germany.

出版信息

J Clin Epidemiol. 1996 Jul;49(7):775-82. doi: 10.1016/0895-4356(96)00011-x.

DOI:10.1016/0895-4356(96)00011-x
PMID:8691228
Abstract

A framework--the "agreement concept"--is developed to study the use of Cohen's kappa as well as alternative measures of chance-corrected agreement in a unified manner. Focusing on intrarater consistency it is demonstrated that for 2 x 2 tables an adequate choice between different measures of chance-corrected agreement can be made only if the characteristics of the observational setting are taken into account. In particular, a naive use of Cohen's kappa may lead to strikingly overoptimistic estimates of chance-corrected agreement. Such bias can be overcome by more elaborate study designs that allow for an unrestricted estimation of the probabilities at issue. When Cohen's kappa is appropriately applied as a measure of chance-corrected agreement, its values prove to be a linear--and not a parabolic--function of true prevalence. It is further shown how the validity of ratings is influenced by lack of consistency. Depending on the design of a validity study, this may lead, on purely formal grounds, to prevalence-dependent estimates of sensitivity and specificity. Proposed formulas for "chance-corrected" validity indexes fail to adjust for this phenomenon.

摘要

本文构建了一个框架——“一致性概念”,以便以统一的方式研究科恩kappa系数以及其他机会校正一致性的替代指标。聚焦于评分者内一致性,研究表明,对于2×2列联表,只有考虑到观察环境的特征,才能在不同的机会校正一致性指标之间做出恰当选择。特别是,单纯使用科恩kappa系数可能会导致对机会校正一致性的估计明显过于乐观。这种偏差可以通过更精细的研究设计来克服,这些设计允许对相关概率进行无限制估计。当科恩kappa系数作为机会校正一致性的指标被恰当地应用时,其值被证明是真实患病率的线性函数,而非抛物线函数。进一步表明了评分的有效性是如何受到缺乏一致性的影响。根据效度研究的设计,纯粹基于形式上的原因,这可能导致对敏感性和特异性的患病率依赖性估计。“机会校正”效度指标的建议公式未能针对这一现象进行调整。

相似文献

1
The meaning of kappa: probabilistic concepts of reliability and validity revisited.kappa的意义:可靠性和有效性的概率概念再探讨。
J Clin Epidemiol. 1996 Jul;49(7):775-82. doi: 10.1016/0895-4356(96)00011-x.
2
Kappa-like indices of observer agreement viewed from a latent class perspective.从潜在类别视角看观察者一致性的类kappa指数。
Stat Med. 1998 Apr 30;17(8):797-812. doi: 10.1002/(sici)1097-0258(19980430)17:8<797::aid-sim776>3.0.co;2-g.
3
How reliable are chance-corrected measures of agreement?一致性的机会校正测量有多可靠?
Stat Med. 1993 Dec 15;12(23):2191-205. doi: 10.1002/sim.4780122305.
4
Chance-corrected measures for 2 × 2 tables that coincide with weighted kappa.2×2 列联表与加权kappa 一致的校正机遇测度。
Br J Math Stat Psychol. 2011 May;64(Pt 2):355-65. doi: 10.1348/2044-8317.002001. Epub 2010 Dec 7.
5
Chance-corrected measures of the validity of a binary diagnostic test.二元诊断试验有效性的机遇校正测量指标。
J Clin Epidemiol. 1994 Jun;47(6):627-33. doi: 10.1016/0895-4356(94)90210-0.
6
[Quality criteria of assessment scales--Cohen's kappa as measure of interrator reliability (1)].评估量表的质量标准——作为评估者信度度量的科恩kappa系数(1)
Pflege. 2004 Feb;17(1):36-46. doi: 10.1024/1012-5302.17.1.36.
7
Clinicians are right not to like Cohen's κ.临床医生不喜欢 Cohen's κ 是对的。
BMJ. 2013 Apr 12;346:f2125. doi: 10.1136/bmj.f2125.
8
A paired kappa to compare binary ratings across two medical tests.比较两种医学检验结果的配对 Kappa 检验。
Stat Med. 2019 Jul 30;38(17):3272-3287. doi: 10.1002/sim.8200. Epub 2019 May 17.
9
Quantifying Interrater Agreement and Reliability Between Thoracic Pathologists: Paradoxical Behavior of Cohen's Kappa in the Presence of a High Prevalence of the Histopathologic Feature in Lung Cancer.量化胸科病理学家之间的评分者间一致性和可靠性:肺癌组织病理学特征高患病率情况下科恩kappa系数的矛盾行为
JTO Clin Res Rep. 2023 Dec 16;5(1):100618. doi: 10.1016/j.jtocrr.2023.100618. eCollection 2024 Jan.
10
Sensitivity and specificity-like measures of the validity of a diagnostic test that are corrected for chance agreement.对诊断试验有效性进行校正以消除偶然一致影响的类似灵敏度和特异度的指标。
Epidemiology. 1992 Mar;3(2):178-81. doi: 10.1097/00001648-199203000-00017.

引用本文的文献

1
Reliability Analysis of Psychological Concept Extraction and Classification in User-penned Text.用户撰写文本中心理概念提取与分类的可靠性分析
Proc Int AAAI Conf Weblogs Soc Media. 2024 May 31;18:422-434. doi: 10.1609/icwsm.v18i1.31324. Epub 2024 May 28.
2
Geographical origin discrimination of Chenpi using machine learning and enhanced mid-level data fusion.基于机器学习和增强型中级数据融合的陈皮产地鉴别
NPJ Sci Food. 2025 Feb 5;9(1):17. doi: 10.1038/s41538-025-00376-0.
3
New procalcitonin point-of-care test meets analytical performances to stratification of infectious syndrome.
新型即时检测降钙素原在感染综合征分层方面达到了分析性能要求。
Pract Lab Med. 2024 Feb 23;39:e00372. doi: 10.1016/j.plabm.2024.e00372. eCollection 2024 Mar.
4
Simulating and estimating agreement in the presence of multiple raters and covariates.模拟和估计存在多个评分者和协变量时的一致性。
Stat Med. 2023 May 20;42(11):1687-1698. doi: 10.1002/sim.9694. Epub 2023 Mar 5.
5
Trained health extension workers correctly identify high blood pressure in rural districts of northwest Ethiopia: a diagnostic accuracy study.经过培训的卫生推广员能在埃塞俄比亚西北部农村地区正确识别高血压:一项诊断准确性研究。
BMC Health Serv Res. 2022 Mar 22;22(1):375. doi: 10.1186/s12913-022-07794-w.
6
Lung Auscultation Using the Smartphone-Feasibility Study in Real-World Clinical Practice.使用智能手机进行肺部听诊的现实临床可行性研究。
Sensors (Basel). 2021 Jul 20;21(14):4931. doi: 10.3390/s21144931.
7
Toward standardizing the clinical testing protocols of point-of-care devices for obstructive sleep apnea diagnosis.迈向阻塞性睡眠呼吸暂停诊断即时检测设备临床测试方案的标准化。
Sleep Breath. 2021 Jun;25(2):737-748. doi: 10.1007/s11325-020-02171-5. Epub 2020 Aug 31.
8
Bioanalytical Performance of a New Particle-Enhanced Method for Measuring Procalcitonin.一种用于测量降钙素原的新型颗粒增强方法的生物分析性能
Diagnostics (Basel). 2020 Jul 7;10(7):461. doi: 10.3390/diagnostics10070461.
9
Analytical performances of a novel point-of-care procalcitonin assay.一种新型即时检测降钙素原检测方法的分析性能
Pract Lab Med. 2019 Oct 26;18:e00145. doi: 10.1016/j.plabm.2019.e00145. eCollection 2020 Jan.
10
Parental Opinions and Attitudes about Children's Vaccination Safety in Silesian Voivodeship, Poland.波兰西里西亚省家长对儿童疫苗接种安全性的看法和态度。
Int J Environ Res Public Health. 2018 Apr 15;15(4):756. doi: 10.3390/ijerph15040756.