Obuchowski Nancy A
Department of Quantitative Health Sciences/Wb4, The Cleveland Clinic Foundation, 9500 Euclid Ave, Cleveland, OH 44195, USA.
Acad Radiol. 2005 Sep;12(9):1198-204. doi: 10.1016/j.acra.2005.05.013.
Investigators often need to assess the accuracies of diagnostic tests when the gold standard is not binary-scale. The objective of this article is to describe nonparametric estimators of diagnostic test accuracy when the gold standard is continuous, ordinal, and nominal scale.
A nonparametric method of estimating and comparing the area under receiver operating characteristic (ROC) curves, proposed by DeLong et al, is extended to situations in which the gold standard is not binary. Two examples illustrate the methods.
Measures of diagnostic test accuracy, their variance, and tests for comparing two diagnostic tests' accuracies in paired designs are presented for situations in which the gold standard is continuous, ordinal, and nominal scale. These summary measures of diagnostic test accuracy are analogous in form and interpretation to the area under the ROC curve.
Dichotomizing the outcomes of a gold standard so that traditional ROC methods can be applied can lead to bias. The methods described here are useful for assessing and comparing summary test accuracy when the gold standard is not binary scale. They have limitations similar to other summary indices.
当金标准不是二分类尺度时,研究者常常需要评估诊断试验的准确性。本文的目的是描述当金标准为连续尺度、有序尺度和名义尺度时诊断试验准确性的非参数估计方法。
将DeLong等人提出的一种估计和比较受试者工作特征(ROC)曲线下面积的非参数方法扩展到金标准不是二分类的情况。两个例子说明了这些方法。
给出了金标准为连续尺度、有序尺度和名义尺度时,诊断试验准确性的测量指标、其方差以及在配对设计中比较两种诊断试验准确性的检验方法。这些诊断试验准确性的汇总指标在形式和解释上与ROC曲线下面积类似。
将金标准的结果二分以使传统ROC方法能够应用可能会导致偏差。当金标准不是二分类尺度时,本文描述的方法对于评估和比较汇总试验准确性很有用。它们具有与其他汇总指标类似的局限性。