Thompson W D
Department of Applied Medical Sciences, University of Southern Maines, Portland 04103.
Epidemiology. 1990 Sep;1(5):357-69. doi: 10.1097/00001648-199009000-00005.
This paper explores the relation between reliability and validity for binary variables. An equation is first derived for expressing the kappa coefficient of reliability as a function of true prevalence, sensitivity, specificity, and the degree of correlation of errors within true positives and within true negatives. It is then shown that under certain circumstances kappa is closely related to an index of the validity of the odds ratio as a means for quantifying an association between an imperfectly classified binary factor and membership in two subpopulations. This index is defined as the ratio of the observed odds ratio minus one to the true odds ratio minus one. Numerical results indicate that, when error rates are high, kappa can be of smaller magnitude than this index of validity. When the value of kappa reflects substantial correlation of errors, however, the value of kappa typically exceeds that of the index of validity. Correlated errors among the true negatives are found to affect the correspondence between kappa and the index of validity rather differently than do correlated errors among the true positives. Although kappa often exceeds the index of validity, use of kappa to correct the odds ratio for attenuation owing to nondifferential misclassification generally leads to undercorrection rather than overcorrection. Consequently, the relation between kappa and the index of the validity of the odds ratio may have practical value for reducing bias in the estimation of exposure-disease associations in epidemiologic research.
本文探讨二元变量的可靠性与有效性之间的关系。首先推导了一个方程,用于将可靠性的kappa系数表示为真实患病率、敏感性、特异性以及真阳性和真阴性内部误差相关程度的函数。然后表明,在某些情况下,kappa与作为量化不完全分类二元因素与两个亚人群成员资格之间关联的一种手段的优势比有效性指数密切相关。该指数定义为观察到的优势比减1与真实优势比减1的比值。数值结果表明,当错误率较高时,kappa可能比该有效性指数的数值小。然而,当kappa值反映出误差的显著相关性时,kappa值通常超过有效性指数的值。发现真阴性之间的相关误差对kappa与有效性指数之间对应关系的影响与真阳性之间的相关误差有很大不同。尽管kappa常常超过有效性指数,但使用kappa来校正因非差异性错误分类导致的优势比衰减通常会导致校正不足而非校正过度。因此,kappa与优势比有效性指数之间的关系对于减少流行病学研究中暴露-疾病关联估计的偏差可能具有实际价值。