Department of Quantitative Health Sciences/JJN3 and the Imaging Institute, Cleveland Clinic Foundation, 9500 Euclid Ave, Cleveland, OH 44195, USA.
Eur Radiol. 2010 Mar;20(3):584-94. doi: 10.1007/s00330-009-1590-4. Epub 2009 Sep 16.
Sensitivity and the false positive rate are usually defined with the patient as the unit of observation, i.e., the diagnostic test detects or does not detect disease in a patient. For tests designed to find and diagnose lesions, e.g., lung nodules, the usual definitions of sensitivity and specificity may be misleading. In this paper we describe and compare five measures of accuracy of lesion detection.
The five levels of evaluation considered were patient level without localization, patient level with localization, region of interest (ROI) level without localization, ROI level with localization, and lesion level.
We found that estimators of sensitivity that do not require the reader to correctly locate the lesion overstate sensitivity. Patient-level estimators of sensitivity can be misleading when there is more than one lesion per patient and they reduce study power. Patient-level estimators of the false positive rate can conceal important differences between techniques. Referring clinicians rely on a test's reported accuracy to both choose the appropriate test and plan management for their patients. If reported sensitivity is overstated, the clinician could choose the test for disease screening, and have false confidence that a negative test represents the true absence of lesions. Similarly, the lower false positive rate associated with patient-level estimators can mislead clinicians about the diagnostic value of the test and consequently that a positive finding is real.
We present clear recommendations for studies assessing and comparing the accuracy of tests tasked with the detection and interpretation of lesions...
通常以患者为观察单位来定义敏感性和假阳性率,即诊断试验在患者中检测或未检测到疾病。对于旨在发现和诊断病变的测试,例如肺结节,通常的敏感性和特异性定义可能会产生误导。在本文中,我们描述并比较了五种病变检测准确性的度量方法。
考虑了五个评估级别,分别是未定位的患者级别、定位的患者级别、无定位的感兴趣区域 (ROI) 级别、定位的 ROI 级别和病变级别。
我们发现,不需要读者正确定位病变的敏感性估计值过高估计了敏感性。当每位患者有多个病变时,患者水平的敏感性估计值可能会产生误导,并且会降低研究的效力。患者水平的假阳性率估计值可能会掩盖技术之间的重要差异。参考临床医生依赖测试报告的准确性来选择适当的测试,并为患者制定管理计划。如果报告的敏感性过高,临床医生可能会选择用于疾病筛查的测试,并错误地认为阴性测试代表病变确实不存在。同样,与患者水平的估计值相关的较低的假阳性率可能会使临床医生对测试的诊断价值产生误解,从而导致阳性结果不真实。
我们提出了明确的建议,用于评估和比较旨在检测和解释病变的测试的准确性……