Guggenmoos-Holzmann I
Department of Biostatistics and Medical Informatics, Free University of Berlin, Germany.
Stat Med. 1993 Dec 15;12(23):2191-205. doi: 10.1002/sim.4780122305.
Chance-corrected measures of agreement are prone to exhibit paradoxical and counter-intuitive results when used as measures of reliability. It is demonstrated that these problems arise with Cohen's kappa as well as with Aickin's alpha. They are the consequence of an analogy to Simpson's paradox in mixed populations. It is further shown that chance-corrected measures of agreement may yield misleading values for binary ratings. It is concluded that improvements in the design and the analysis of reliability studies are a prerequisite for valid and pertinent results.
当用作可靠性度量时,一致性的机会校正度量容易出现矛盾和违反直觉的结果。结果表明,这些问题在科恩kappa系数以及艾金alpha系数中都会出现。它们是混合总体中与辛普森悖论类似情况的结果。进一步表明,一致性的机会校正度量对于二元评级可能会产生误导性的值。得出的结论是,可靠性研究的设计和分析的改进是获得有效和相关结果的先决条件。