• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

临床医生不喜欢 Cohen's κ 是对的。

Clinicians are right not to like Cohen's κ.

机构信息

Department of Epidemiology and Biostatistics, EMGO Institute for Health and Care Research, VU University Medical Center, Amsterdam, Netherlands.

出版信息

BMJ. 2013 Apr 12;346:f2125. doi: 10.1136/bmj.f2125.

DOI:10.1136/bmj.f2125
PMID:23585065
Abstract

Clinicians are interested in observer variation in terms of the probability of other raters (interobserver) or themselves (intraobserver) obtaining the same answer. Cohen's κ is commonly used in the medical literature to express such agreement in categorical outcomes. The value of Cohen's κ, however, is not sufficiently informative because it is a relative measure, while the clinician's question of observer variation calls for an absolute measure. Using an example in which the observed agreement and κ lead to different conclusions, we illustrate that percentage agreement is an absolute measure (a measure of agreement) and that κ is a relative measure (a measure of reliability). For the data to be useful for clinicians, measures of agreement should be used. The proportion of specific agreement, expressing the agreement separately for the positive and the negative ratings, is the most appropriate measure for conveying the relevant information in a 2 × 2 table and is most informative for clinicians.

摘要

临床医生关注观察者之间的差异,即其他评估者(观察者间)或他们自己(观察者内)获得相同答案的可能性。Cohen's κ 常用于医学文献中,以表示分类结果的一致性。然而,Cohen's κ 的值信息量不足,因为它是一个相对度量,而临床医生对观察者差异的问题需要一个绝对的度量。我们通过一个例子来说明,在这个例子中,观察到的一致性和 κ 导致了不同的结论,我们说明百分比一致性是一个绝对度量(一致性的度量),而 κ 是一个相对度量(可靠性的度量)。为了使数据对临床医生有用,应该使用一致性度量。具体一致性的比例,分别表示阳性和阴性评分的一致性,是在 2×2 表中传达相关信息的最合适度量,对临床医生最具信息量。

相似文献

1
Clinicians are right not to like Cohen's κ.临床医生不喜欢 Cohen's κ 是对的。
BMJ. 2013 Apr 12;346:f2125. doi: 10.1136/bmj.f2125.
2
Interobserver agreement: Cohen's kappa coefficient does not necessarily reflect the percentage of patients with congruent classifications.观察者间一致性:科恩卡方系数不一定反映分类一致的患者百分比。
Int J Clin Pharmacol Ther. 1997 Mar;35(3):93-5.
3
[Analyzing interrater agreement for categorical data using Cohen's kappa and alternative coefficients].[使用科恩kappa系数及其他系数分析分类数据的评分者间一致性]
Rehabilitation (Stuttg). 2007 Dec;46(6):370-7. doi: 10.1055/s-2007-976535.
4
[Quality criteria of assessment scales--Cohen's kappa as measure of interrator reliability (1)].评估量表的质量标准——作为评估者信度度量的科恩kappa系数(1)
Pflege. 2004 Feb;17(1):36-46. doi: 10.1024/1012-5302.17.1.36.
5
Pitfalls in the use of kappa when interpreting agreement between multiple raters in reliability studies.在可靠性研究中解释多个评分者之间的一致性时使用卡帕值的陷阱。
Physiotherapy. 2014 Mar;100(1):27-35. doi: 10.1016/j.physio.2013.08.002. Epub 2013 Nov 18.
6
Reproducibility of the implant crown aesthetic index--rating aesthetics of single-implant crowns and adjacent soft tissues with regard to observer dental specialization.种植体冠美学指数的可重复性——关于观察者牙科专业对单颗种植体冠及相邻软组织美学的评级
Clin Implant Dent Relat Res. 2009 Sep;11(3):201-13. doi: 10.1111/j.1708-8208.2008.00107.x. Epub 2008 Jul 23.
7
Weighted specific-category kappa measure of interobserver agreement.观察者间一致性的加权特定类别kappa测量
Psychol Rep. 2003 Dec;93(3 Pt 2):1283-90. doi: 10.2466/pr0.2003.93.3f.1283.
8
Chance-corrected measures for 2 × 2 tables that coincide with weighted kappa.2×2 列联表与加权kappa 一致的校正机遇测度。
Br J Math Stat Psychol. 2011 May;64(Pt 2):355-65. doi: 10.1348/2044-8317.002001. Epub 2010 Dec 7.
9
Kappa-like indices of observer agreement viewed from a latent class perspective.从潜在类别视角看观察者一致性的类kappa指数。
Stat Med. 1998 Apr 30;17(8):797-812. doi: 10.1002/(sici)1097-0258(19980430)17:8<797::aid-sim776>3.0.co;2-g.
10
Interrater and intrarater agreement of the chicago classification of achalasia subtypes using high-resolution esophageal manometry.采用高分辨率食管测压法对贲门失弛缓症亚型的芝加哥分类进行观察者间和观察者内一致性评估。
Am J Gastroenterol. 2012 Feb;107(2):207-14. doi: 10.1038/ajg.2011.353. Epub 2011 Oct 18.

引用本文的文献

1
Agreement of Specific Lung Sounds Auscultation by Veterinarians for the Detection of Bronchopneumonia in Calves.兽医听诊特定肺部声音以检测犊牛支气管肺炎的一致性
J Vet Intern Med. 2025 Sep-Oct;39(5):e70203. doi: 10.1111/jvim.70203.
2
Automating Data Entry from Electronic Health Record to Electronic Data Capture Using a Trusted Cloud-Based Application in Multisite Cancer Clinical Trials.在多中心癌症临床试验中,使用基于云的可信应用程序实现从电子健康记录到电子数据采集的数据录入自动化。
J Soc Clin Data Manag. 2025 Winter;5(1):1-16. doi: 10.47912/jscdm.371. Epub 2025 Jan 14.
3
Inter- and Intrarater Agreement of the AO Spine-DGOU Osteoporotic Fracture Classification System Using Radiography and Computed Tomography Imaging.
使用X线摄影和计算机断层扫描成像的AO脊柱-DGOU骨质疏松性骨折分类系统的评分者间和评分者内一致性
Global Spine J. 2025 Feb 7:21925682251318654. doi: 10.1177/21925682251318654.
4
Development and Feasibility Study of a Triage Tool for Early Referral to Spinal Cord Stimulation for Patients With Chronic Low Back and Leg Pain.慢性腰腿痛患者脊髓刺激早期转诊分诊工具的开发与可行性研究
Eur J Pain. 2025 Feb;29(2):e4780. doi: 10.1002/ejp.4780.
5
Reliability of the McKenzie Method of Mechanical Diagnosis and Therapy in the examination of spinal pain, including the OTHER classifications: Reliability of the McKenzie Method in spinal pain.麦肯齐机械诊断与治疗方法在脊柱疼痛检查中的可靠性,包括其他分类:麦肯齐方法在脊柱疼痛中的可靠性。
Braz J Phys Ther. 2025 Jan-Feb;29(1):101154. doi: 10.1016/j.bjpt.2024.101154. Epub 2024 Dec 13.
6
Inter-rater agreement for detection of potentially inappropriate medication according to explicit and implicit STOPP criteria.根据明确和隐含的STOPP标准检测潜在不适当用药的评估者间一致性。
Br J Clin Pharmacol. 2025 Feb;91(2):485-490. doi: 10.1111/bcp.16352. Epub 2024 Dec 2.
7
The Potential of Percent Agreement as an Adjunctive Diagnostic Tool for Acute Temporomandibular Disorder.一致性百分比作为急性颞下颌关节紊乱辅助诊断工具的潜力
J Clin Med. 2024 Sep 10;13(18):5360. doi: 10.3390/jcm13185360.
8
Validity and reliability of the Pain Assessment in Impaired Cognition 15 (PAIC15) observation scale in persons with aphasia.认知障碍 15 项疼痛评估观察量表(PAIC15)在失语症患者中的有效性和可靠性。
BMC Neurol. 2024 Sep 5;24(1):319. doi: 10.1186/s12883-024-03824-8.
9
Trajectories of clinical characteristics, complications and treatment choices in data-driven subgroups of type 2 diabetes.基于数据驱动的 2 型糖尿病亚组的临床特征、并发症和治疗选择的轨迹。
Diabetologia. 2024 Jul;67(7):1343-1355. doi: 10.1007/s00125-024-06147-y. Epub 2024 Apr 16.
10
Measurement and documentation of quality indicators for the end-of-life care of hospital patients a nationwide retrospective record review study.测量和记录医院患者临终关怀质量指标的全国回顾性病历审查研究。
BMC Palliat Care. 2023 Nov 8;22(1):174. doi: 10.1186/s12904-023-01299-x.