Biometric Research Program, NCI, NIH, 9609 Medical Center Drive, Room 5W124, MSC 9735, Bethesda, MD 20892.
Molecular Imaging Program, NCI, NIH, Bethesda, Maryland.
Acad Radiol. 2018 Oct;25(10):1325-1332. doi: 10.1016/j.acra.2018.01.030. Epub 2018 Mar 16.
To point out the problems with Cohen kappa statistic and to explore alternative metrics to determine interobserver agreement on lesion detection when locations are not prespecified.
Use of kappa and two alternative methods, namely index of specific agreement (ISA) and modified kappa, for measuring interobserver agreement on the location of detected lesions are presented. These indices of agreement are illustrated by application to a retrospective multireader study in which nine readers detected and scored prostate cancer lesions in 163 consecutive patients (n = 110 cases, n = 53 controls) using the guideline of Prostate Imaging Reporting and Data System version 2 on multiparametric magnetic resonance imaging.
The proposed modified kappa, which properly corrects for the amount of agreement by chance, is shown to be approximately equivalent to the ISA. In the prostate cancer data, average kappa, modified kappa, and ISA equaled 30%, 55%, and 57%, respectively, for all lesions and 20%, 87%, and 87%, respectively, for index lesions.
The application of kappa could result in a substantial downward bias in reader agreement on lesion detection when locations are not prespecified. ISA is recommended for assessment of reader agreement on lesion detection.
指出 Cohen kappa 统计量的问题,并探索替代指标,以确定在未指定位置时检测病变的观察者间一致性。
介绍了 kappa 和两种替代方法(即特异性一致指数(ISA)和修正 kappa)在检测病变位置的观察者间一致性的应用。这些一致性指标通过应用于回顾性多读者研究来说明,该研究中,9 位读者使用前列腺影像报告和数据系统第 2 版指南在 163 例连续患者(n=110 例,n=53 例对照)中检测和评分前列腺癌病变,使用多参数磁共振成像。
提出的修正 kappa 适当校正了因偶然而产生的一致性量,结果表明与 ISA 大致相当。在前列腺癌数据中,所有病变的平均 kappa、修正 kappa 和 ISA 分别为 30%、55%和 57%,而指数病变分别为 20%、87%和 87%。
当位置未指定时,kappa 的应用可能导致对病变检测的观察者间一致性产生实质性的向下偏差。ISA 推荐用于评估病变检测的读者一致性。