Gefeller O, Brenner H
Department of Medical Statistics, University of Göttingen, Germany.
Methods Inf Med. 1994 May;33(2):180-6.
The traditional concept of describing the validity of a diagnostic test neglects the presence of chance agreement between test result and true (disease) status. Sensitivity and specificity, as the fundamental measures of validity, can thus only be considered in conjunction with each other to provide an appropriate basis for the evaluation of the capacity of the test to discriminate truly diseased from truly undiseased subjects. In this paper, chance-corrected analogues of sensitivity and specificity are presented as supplemental measures of validity, which pay attention to the problem of chance agreement and offer the opportunity to be interpreted separately. While recent proposals of chance-correction techniques, suggested by several authors in this context, lead to measures which are dependent on disease prevalence, our method does not share this major disadvantage. We discuss the extension of the conventional ROC-curve approach to chance-corrected measures of sensitivity and specificity. Furthermore, point and asymptotic interval estimates of the parameters of interest are derived under different sampling frameworks for validation studies. The small sample behavior of the estimates is investigated in a simulation study, leading to a logarithmic modification of the interval estimate in order to hold the nominal confidence level for small samples.
描述诊断试验有效性的传统概念忽略了试验结果与真实(疾病)状态之间偶然一致性的存在。因此,作为有效性基本指标的灵敏度和特异度,只能相互结合起来考虑,以便为评估该试验区分真正患病和真正未患病个体的能力提供适当依据。本文提出了灵敏度和特异度的机遇校正类似指标,作为有效性的补充指标,它们关注偶然一致性问题,并提供了单独解释的机会。虽然在这方面有几位作者最近提出的机遇校正技术建议会得出依赖于疾病患病率的指标,但我们的方法不存在这一主要缺点。我们讨论了将传统的ROC曲线方法扩展到灵敏度和特异度的机遇校正指标。此外,还在不同的抽样框架下推导了验证研究中感兴趣参数的点估计和渐近区间估计。通过模拟研究考察了估计量的小样本行为,从而对区间估计进行对数修正,以保持小样本下的名义置信水平。