Valenstein P N
Department of Pathology, State University of New York, Stony Brook.
Am J Clin Pathol. 1990 Feb;93(2):252-8. doi: 10.1093/ajcp/93.2.252.
New diagnostic tests frequently are evaluated against gold standards that are assumed to classify patients with unerring accuracy according to the presence or absence of disease. In practice, gold standards rarely are perfect predictors of disease and tend to misclassify a small number of patients. When an imperfect standard is used to evaluate a diagnostic test, many commonly used measures of test performance are distorted. It is not widely appreciated that these distortions occur in predictable directions and that they may be of considerable magnitude, even when the gold standard has a high degree of accuracy. The diagnostic powers of clinical tests will be more accurately reported if consideration is given to the types of biases that result from the use of imperfect standards. Several different approaches may be used to minimize these distortions when evaluating new tests.
新的诊断测试常常对照金标准进行评估,这些金标准被假定能根据疾病的有无以无误的准确性对患者进行分类。实际上,金标准很少是疾病的完美预测指标,往往会将少数患者误分类。当使用不完美的标准来评估诊断测试时,许多常用的测试性能指标会被扭曲。人们并未广泛认识到这些扭曲会以可预测的方向出现,而且即便金标准具有高度准确性,其扭曲程度也可能相当大。如果考虑到使用不完美标准所导致的偏差类型,临床测试的诊断能力将能得到更准确的报告。在评估新测试时,可以采用几种不同的方法来尽量减少这些扭曲。