Suppr
超能文献

多夫曼-贝鲍姆-梅茨法与奥布霍夫斯基-罗凯特法用于接受者操作特征（ROC）数据的比较。

A comparison of the Dorfman-Berbaum-Metz and Obuchowski-Rockette methods for receiver operating characteristic (ROC) data.

作者信息

Hillis Stephen L, Obuchowski Nancy A, Schartz Kevin M, Berbaum Kevin S

机构信息

Center for Research in the Implementation of Innovative Strategies in Practice, Iowa City VA Medical Center, Iowa City, IA, USA.

出版信息

Stat Med. 2005 May 30;24(10):1579-607. doi: 10.1002/sim.2024.

DOI:10.1002/sim.2024

PMID:15685718

Abstract

There are several different statistical methods for analysing multireader ROC studies, with the Dorfman-Berbaum-Metz (DBM) method being the most frequently used. Another method is the corrected F method proposed by Obuchowski and Rockette (OR). The DBM and OR procedures at first appear quite different: DBM is a three-way ANOVA analysis of pseudovalues while OR is a two-way ANOVA analysis of accuracy estimates with correlated errors. We show that the original DBM and OR F statistics for testing the null hypothesis of equal treatments have the same form and will typically have similar values; however, differences in the denominator degrees of freedom will result in differences in p-values even when the F statistics are identical. We show how the methods can be generalized to include variations in the accuracy measure, covariance method, and degrees of freedom. Identical results are obtained when the methods agree with respect to all three of these procedure parameters; hence for a particular choice of procedure parameters the choice of method appears to depend mainly on software preference and availability. The methods are compared using data from a factorial study with two modalities, five readers, and 114 patients.

摘要

有几种不同的统计方法可用于分析多读者ROC研究，其中Dorfman-Berbaum-Metz（DBM）方法是最常用的。另一种方法是Obuchowski和Rockette（OR）提出的校正F方法。DBM和OR程序乍一看有很大不同：DBM是对伪值的三向方差分析，而OR是对具有相关误差的准确性估计的双向方差分析。我们表明，用于检验处理相等的原假设的原始DBM和OR F统计量具有相同的形式，并且通常会有相似的值；然而，分母自由度的差异即使在F统计量相同时也会导致p值的差异。我们展示了如何将这些方法推广到包括准确性度量、协方差方法和自由度的变化。当这些方法在所有这三个程序参数方面一致时，会得到相同的结果；因此，对于特定的程序参数选择，方法的选择似乎主要取决于软件偏好和可用性。使用来自一项析因研究的数据对这些方法进行比较，该研究涉及两种模式、五位读者和114名患者。