kappa统计量的不良行为。

The disagreeable behaviour of the kappa statistic.

作者信息

Flight Laura, Julious Steven A

机构信息

Medical Statistics Group, University of Sheffield, Sheffield, England.

出版信息

Pharm Stat. 2015 Jan-Feb;14(1):74-8. doi: 10.1002/pst.1659. Epub 2014 Dec 3.

DOI:10.1002/pst.1659

PMID:25470361

Abstract

It is often of interest to measure the agreement between a number of raters when an outcome is nominal or ordinal. The kappa statistic is used as a measure of agreement. The statistic is highly sensitive to the distribution of the marginal totals and can produce unreliable results. Other statistics such as the proportion of concordance, maximum attainable kappa and prevalence and bias adjusted kappa should be considered to indicate how well the kappa statistic represents agreement in the data. Each kappa should be considered and interpreted based on the context of the data being analysed.

摘要

当结果为名义变量或有序变量时，测量多个评估者之间的一致性通常很有意义。kappa统计量用作一致性的度量。该统计量对边际总数的分布高度敏感，可能会产生不可靠的结果。应考虑其他统计量，如一致性比例、最大可达到的kappa以及患病率和偏差调整后的kappa，以表明kappa统计量在数据中代表一致性的程度。应根据所分析数据的背景来考虑和解释每个kappa。