Kundel Harold L, Polansky Marcia
Department of Radiology and MCP Hahnemann School of Public Health, University of Pennsylvania Medical Center, 3600 Market St, Suite 370, Philadelphia, PA 19104, USA.
Radiology. 2003 Aug;228(2):303-8. doi: 10.1148/radiol.2282011860. Epub 2003 Jun 20.
Statistical measures are described that are used in diagnostic imaging for expressing observer agreement in regard to categorical data. The measures are used to characterize the reliability of imaging methods and the reproducibility of disease classifications and, occasionally with great care, as the surrogate for accuracy. The review concentrates on the chance-corrected indices, kappa and weighted kappa. Examples from the imaging literature illustrate the method of calculation and the effects of both disease prevalence and the number of rating categories. Other measures of agreement that are used less frequently, including multiple-rater kappa, are referenced and described briefly.
本文描述了用于诊断成像的统计方法,这些方法用于表达观察者在分类数据方面的一致性。这些方法用于表征成像方法的可靠性和疾病分类的可重复性,并且在经过谨慎考虑后,偶尔也用作准确性的替代指标。本文综述主要集中在机会校正指数,即kappa和加权kappa。成像文献中的示例说明了计算方法以及疾病患病率和评级类别数量的影响。文中还引用并简要描述了其他使用频率较低的一致性度量方法,包括多评估者kappa。