Roe C A, Metz C E
Department of Radiology, University of Chicago Medical Center, IL 60637-1470, USA.
Acad Radiol. 1997 Apr;4(4):298-303. doi: 10.1016/s1076-6332(97)80032-3.
The authors examined the relationship between the critical P value (alpha) and the empirical type I error rate when using the Dorfman-Berbaum-Metz (DMB) method for analysis of variance in multireader, multimodality receiver operating characteristic (ROC) data.
The authors developed a linear mixed-effect model to generate continuous, normally distributed random decision variables containing multiple sources (components) of variation. A range of magnitudes for these variance components was used to stimulate experiments in which multiple readers (three or five) read imaged obtained with two modalities from the same set of cases with no re-reading. Three binormal population ROC curves, with areas of 0.962, 0.855, and 0.702, were included. Case-sample sizes ranged from 50 to 400, and either 50% or 10% of cases were actually positive. For each experiment, 2,000 data sets were analyzed by the computer program, and the proportion of 2,000 modality differences that was found to be statistically significant at an alpha level of .05 was tubulated.
The test for modality difference performed well for the low and intermediate ROC curves, even with small case samples. For the high ROC curve, the small-sample results were conservative. No relationship between observed type I error rate and the magnitude of data correlation was evident.
For typical ROC curves, the DBM method is robust in testing for modality effects in the null case, given a sufficient sample size. Instructions for obtaining a free copy of the software are given.
作者研究了在多读者、多模态接收器操作特性(ROC)数据的方差分析中使用多夫曼 - 伯鲍姆 - 梅茨(DMB)方法时,临界P值(α)与实际I型错误率之间的关系。
作者开发了一个线性混合效应模型,以生成包含多个变异源(成分)的连续、正态分布的随机决策变量。这些方差成分的一系列大小被用于模拟实验,其中多个读者(三个或五个)读取从同一组病例中通过两种模态获得的图像,且不进行重新读取。纳入了三条双正态总体ROC曲线,其面积分别为0.962、0.855和0.702。病例样本量从50到400不等,且50%或10%的病例实际为阳性。对于每个实验,计算机程序分析2000个数据集,并将在α水平为0.05时发现具有统计学显著性的2000个模态差异的比例制成表格。
即使病例样本量较小,对于低和中等ROC曲线,模态差异检验效果良好。对于高ROC曲线,小样本结果较为保守。未观察到实际I型错误率与数据相关性大小之间存在明显关系。
对于典型的ROC曲线,在样本量充足的情况下,DBM方法在检验无效假设情况下的模态效应时具有稳健性。给出了获取该软件免费副本的说明。