• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

加权卡帕系数的一种新解释。

A New Interpretation of the Weighted Kappa Coefficients.

作者信息

Vanbelle Sophie

机构信息

Department of Methodology & Statistics, CAPHRI School for Public Health and Primary Care, Maastricht University, P.O. Box 616, 6200 MD, Maastricht, The Netherlands.

出版信息

Psychometrika. 2016 Jun;81(2):399-410. doi: 10.1007/s11336-014-9439-4. Epub 2014 Dec 17.

DOI:10.1007/s11336-014-9439-4
PMID:25516203
Abstract

Reliability and agreement studies are of paramount importance. They do contribute to the quality of studies by providing information about the amount of error inherent to any diagnosis, score or measurement. Guidelines for reporting reliability and agreement studies were recently provided. While the use of the kappa-like family is advised for categorical and ordinal scales, no further guideline in the choice of a weighting scheme is given. In the present paper, a new simple and practical interpretation of the linear- and quadratic-weighted kappa coefficients is given. This will help researchers in motivating their choice of a weighting scheme.

摘要

可靠性和一致性研究至关重要。它们通过提供有关任何诊断、评分或测量中固有误差量的信息,确实有助于提高研究质量。最近提供了报告可靠性和一致性研究的指南。虽然对于分类和有序量表建议使用类kappa族,但在加权方案的选择上没有给出进一步的指导。在本文中,对线性加权和二次加权kappa系数给出了一种新的简单实用的解释。这将有助于研究人员说明其对加权方案的选择动机。

相似文献

1
A New Interpretation of the Weighted Kappa Coefficients.加权卡帕系数的一种新解释。
Psychometrika. 2016 Jun;81(2):399-410. doi: 10.1007/s11336-014-9439-4. Epub 2014 Dec 17.
2
A Note on the Linearly and Quadratically Weighted Kappa Coefficients.关于线性加权和二次加权卡帕系数的一则注释。
Psychometrika. 2016 Sep;81(3):795-801. doi: 10.1007/s11336-016-9501-5. Epub 2016 May 31.
3
Dependence of weighted kappa coefficients on the number of categories.加权kappa系数对类别数量的依赖性。
Epidemiology. 1996 Mar;7(2):199-202. doi: 10.1097/00001648-199603000-00016.
4
Quality assessment of ordinal scale reproducibility: log-linear models provided useful information on scale structure.有序量表可重复性的质量评估:对数线性模型提供了关于量表结构的有用信息。
J Clin Epidemiol. 2008 Oct;61(10):983-90. doi: 10.1016/j.jclinepi.2007.11.004. Epub 2008 May 27.
5
Asymptotic variability of (multilevel) multirater kappa coefficients.(多层次)多评分者kappa 系数的渐近变异性。
Stat Methods Med Res. 2019 Oct-Nov;28(10-11):3012-3026. doi: 10.1177/0962280218794733. Epub 2018 Aug 22.
6
Modelling patterns of agreement for nominal scales.名义尺度一致性模式的建模
Stat Med. 2008 Mar 15;27(6):810-30. doi: 10.1002/sim.2945.
7
Measures of clinical agreement for nominal and categorical data: the kappa coefficient.名义数据和分类数据的临床一致性测量:kappa系数。
Comput Biol Med. 1992 Jul;22(4):239-46. doi: 10.1016/0010-4825(92)90063-s.
8
Analysis of the Weighted Kappa and Its Maximum with Markov Moves.基于马尔可夫转移的加权 Kappa 分析及其最大值
Psychometrika. 2022 Dec;87(4):1270-1289. doi: 10.1007/s11336-022-09844-y. Epub 2022 Feb 3.
9
Measures of interrater agreement.评价者间一致性的度量。
J Thorac Oncol. 2011 Jan;6(1):6-7. doi: 10.1097/JTO.0b013e318200f983.
10
The kappa statistic in rehabilitation research: an examination.康复研究中的kappa统计量:一项考察。
Arch Phys Med Rehabil. 2004 Aug;85(8):1371-6. doi: 10.1016/j.apmr.2003.12.002.

引用本文的文献

1
Artificial intelligence-generated apparent diffusion coefficient (AI-ADC) maps for prostate gland assessment: a multi-reader study.用于前列腺评估的人工智能生成的表观扩散系数(AI-ADC)图:一项多阅片者研究。
Eur Radiol. 2025 Jul 21. doi: 10.1007/s00330-025-11871-z.
2
Leveraging on large language model to classify sentences: a case study applying STAGES scoring methodology for sentence completion test on ego development.利用大语言模型对句子进行分类:一项将STAGES评分方法应用于自我发展句子完成测试的案例研究。
Front Psychol. 2025 Feb 6;16:1488102. doi: 10.3389/fpsyg.2025.1488102. eCollection 2025.
3
Which is the Superior Thoracolumbar Injury Classification Tool? TLICS Versus AOSpine 2013: A Systematic Review.

本文引用的文献

1
Guidelines for Reporting Reliability and Agreement Studies (GRRAS) were proposed.报告可靠性和一致性研究(GRRAS)指南被提出。
J Clin Epidemiol. 2011 Jan;64(1):96-106. doi: 10.1016/j.jclinepi.2010.03.002. Epub 2010 Jun 17.
2
Weighted kappa: nominal scale agreement with provision for scaled disagreement or partial credit.加权kappa系数:用于衡量名义尺度上的一致性,并考虑了尺度不一致或部分得分的情况。
Psychol Bull. 1968 Oct;70(4):213-20. doi: 10.1037/h0026256.
3
Intraclass correlations: uses in assessing rater reliability.组内相关系数:在评估评分者可靠性中的应用。
哪种是胸腰段脊柱损伤的最佳分类工具?胸腰段损伤分类及严重程度评分(TLICS)与AO脊柱2013分类法:一项系统评价
Global Spine J. 2025 May;15(4):2536-2546. doi: 10.1177/21925682241311303. Epub 2024 Dec 25.
4
A comprehensive guide to study the agreement and reliability of multi-observer ordinal data.一份研究多观察者有序数据一致性和可靠性的综合指南。
BMC Med Res Methodol. 2024 Dec 20;24(1):310. doi: 10.1186/s12874-024-02431-y.
5
The generic version of China Health Related Outcomes Measures (CHROME-G): psychometric testing and comparative performance with the EQ-5D-5L and SF-6Dv2 among the Chinese general population.中国健康相关结局测量通用版(CHROME-G):在中国普通人群中的心理测量测试以及与EQ-5D-5L和SF-6Dv2的比较表现
BMC Public Health. 2024 Dec 18;24(1):3485. doi: 10.1186/s12889-024-20999-4.
6
Development of a Simple Patient-reported Outcome Measurement for Terminally Ill Cancer Patients Receiving Home-based Palliative Care.为接受居家姑息治疗的晚期癌症患者开发一种简单的患者报告结局测量方法。
Indian J Palliat Care. 2024 Jul-Sep;30(3):260-267. doi: 10.25259/IJPC_100_2024. Epub 2024 Aug 14.
7
Large language models facilitate the generation of electronic health record phenotyping algorithms.大语言模型有助于电子健康记录表型算法的生成。
J Am Med Inform Assoc. 2024 Sep 1;31(9):1994-2001. doi: 10.1093/jamia/ocae072.
8
Letter to the Editor: "Development and validation of equations for conversion from DAS28ESR and DAS28CRP to the SDAI in patients with rheumatoid arthritis".致编辑的信:“类风湿关节炎患者从DAS28ESR和DAS28CRP转换为SDAI的方程的开发与验证”
Clin Rheumatol. 2024 May;43(5):1779-1781. doi: 10.1007/s10067-024-06924-5. Epub 2024 Apr 6.
9
Translating and establishing the psychometric properties of the Jenkins Sleep Scale for Arabic-speaking individuals.将詹金斯睡眠量表翻译成阿拉伯语并建立其心理测量学特性。
BMC Psychiatry. 2024 Mar 28;24(1):236. doi: 10.1186/s12888-024-05714-2.
10
Characterising landcover changes and urban sprawl using geospatial techniques and landscape metrics in Bulawayo, Zimbabwe (1984-2022).利用地理空间技术和景观指标对津巴布韦布拉瓦约的土地覆盖变化和城市扩张进行特征分析(1984 - 2022年)
Heliyon. 2024 Mar 10;10(6):e27275. doi: 10.1016/j.heliyon.2024.e27275. eCollection 2024 Mar 30.
Psychol Bull. 1979 Mar;86(2):420-8. doi: 10.1037//0033-2909.86.2.420.
4
The dependence of Cohen's kappa on the prevalence does not matter.科恩kappa系数对患病率的依赖性无关紧要。
J Clin Epidemiol. 2005 Jul;58(7):655-61. doi: 10.1016/j.jclinepi.2004.02.021. Epub 2005 Apr 18.
5
Dependence of weighted kappa coefficients on the number of categories.加权kappa系数对类别数量的依赖性。
Epidemiology. 1996 Mar;7(2):199-202. doi: 10.1097/00001648-199603000-00016.
6
Bias, prevalence and kappa.偏倚、患病率及kappa值
J Clin Epidemiol. 1993 May;46(5):423-9. doi: 10.1016/0895-4356(93)90018-v.
7
A proposed index for measuring agreement in test-retest studies.一种用于测量重测研究中一致性的提议指数。
J Chronic Dis. 1966 Sep;19(9):991-1006. doi: 10.1016/0021-9681(66)90032-4.
8
High agreement but low kappa: I. The problems of two paradoxes.高一致性但低卡帕值:I. 两个悖论的问题。
J Clin Epidemiol. 1990;43(6):543-9. doi: 10.1016/0895-4356(90)90158-l.
9
High agreement but low kappa: II. Resolving the paradoxes.高一致性但低卡帕值:II. 解决悖论
J Clin Epidemiol. 1990;43(6):551-8. doi: 10.1016/0895-4356(90)90159-m.
10
Methods for estimating the parameters of a linear model for ordered categorical data.用于估计有序分类数据线性模型参数的方法。
Biometrics. 1992 Mar;48(1):271-81.