病变检测的患者水平分析中的偏差、风险低估和统计效力损失。

Bias, underestimation of risk, and loss of statistical power in patient-level analyses of lesion detection.

机构信息

Department of Quantitative Health Sciences/JJN3 and the Imaging Institute, Cleveland Clinic Foundation, 9500 Euclid Ave, Cleveland, OH 44195, USA.

出版信息

Eur Radiol. 2010 Mar;20(3):584-94. doi: 10.1007/s00330-009-1590-4. Epub 2009 Sep 16.

DOI:10.1007/s00330-009-1590-4

PMID:19763582

Abstract

PURPOSE

Sensitivity and the false positive rate are usually defined with the patient as the unit of observation, i.e., the diagnostic test detects or does not detect disease in a patient. For tests designed to find and diagnose lesions, e.g., lung nodules, the usual definitions of sensitivity and specificity may be misleading. In this paper we describe and compare five measures of accuracy of lesion detection.

METHODS

The five levels of evaluation considered were patient level without localization, patient level with localization, region of interest (ROI) level without localization, ROI level with localization, and lesion level.

RESULTS

We found that estimators of sensitivity that do not require the reader to correctly locate the lesion overstate sensitivity. Patient-level estimators of sensitivity can be misleading when there is more than one lesion per patient and they reduce study power. Patient-level estimators of the false positive rate can conceal important differences between techniques. Referring clinicians rely on a test's reported accuracy to both choose the appropriate test and plan management for their patients. If reported sensitivity is overstated, the clinician could choose the test for disease screening, and have false confidence that a negative test represents the true absence of lesions. Similarly, the lower false positive rate associated with patient-level estimators can mislead clinicians about the diagnostic value of the test and consequently that a positive finding is real.

CONCLUSION

We present clear recommendations for studies assessing and comparing the accuracy of tests tasked with the detection and interpretation of lesions...

摘要

目的

通常以患者为观察单位来定义敏感性和假阳性率，即诊断试验在患者中检测或未检测到疾病。对于旨在发现和诊断病变的测试，例如肺结节，通常的敏感性和特异性定义可能会产生误导。在本文中，我们描述并比较了五种病变检测准确性的度量方法。

方法

考虑了五个评估级别，分别是未定位的患者级别、定位的患者级别、无定位的感兴趣区域 (ROI) 级别、定位的 ROI 级别和病变级别。

结果

我们发现，不需要读者正确定位病变的敏感性估计值过高估计了敏感性。当每位患者有多个病变时，患者水平的敏感性估计值可能会产生误导，并且会降低研究的效力。患者水平的假阳性率估计值可能会掩盖技术之间的重要差异。参考临床医生依赖测试报告的准确性来选择适当的测试，并为患者制定管理计划。如果报告的敏感性过高，临床医生可能会选择用于疾病筛查的测试，并错误地认为阴性测试代表病变确实不存在。同样，与患者水平的估计值相关的较低的假阳性率可能会使临床医生对测试的诊断价值产生误解，从而导致阳性结果不真实。

结论

我们提出了明确的建议，用于评估和比较旨在检测和解释病变的测试的准确性……

相似文献

Bias, underestimation of risk, and loss of statistical power in patient-level analyses of lesion detection.

Eur Radiol. 2010 Mar;20(3):584-94. doi: 10.1007/s00330-009-1590-4. Epub 2009 Sep 16.

Data analysis for detection and localization of multiple abnormalities with application to mammography.

Acad Radiol. 2000 Jul;7(7):516-25. doi: 10.1016/s1076-6332(00)80324-4.

Assessing operating characteristics of CAD algorithms in the absence of a gold standard.

Med Phys. 2010 Apr;37(4):1788-95. doi: 10.1118/1.3352687.

Massive training artificial neural network (MTANN) for reduction of false positives in computerized detection of lung nodules in low-dose computed tomography.

Med Phys. 2003 Jul;30(7):1602-17. doi: 10.1118/1.1580485.

The usefulness of 99mTc-tetrofosmin SPECT in the detection of lung metastases from extrapulmonary primary tumors.

Radiol Med. 2004 Jan-Feb;107(1-2):113-27.

Computer-assisted assessment of ultrasound real-time elastography: initial experience in 145 breast lesions.

Eur J Radiol. 2014 Jan;83(1):e1-7. doi: 10.1016/j.ejrad.2013.09.009. Epub 2013 Sep 23.

Screening for disease: making evidence-based choices.

Clin J Oncol Nurs. 2006 Feb;10(1):73-6. doi: 10.1188/06.CJON.73-76.

引用本文的文献

Breast MRI in the era of diffusion weighted imaging: do we still need signal-intensity time curves?

Eur Radiol. 2020 Jan;30(1):47-56. doi: 10.1007/s00330-019-06346-x. Epub 2019 Jul 29.

Breast lesion detection and characterization with contrast-enhanced magnetic resonance imaging: Prospective randomized intraindividual comparison of gadoterate meglumine (0.15 mmol/kg) and gadobenate dimeglumine (0.075 mmol/kg) at 3T.

J Magn Reson Imaging. 2019 Apr;49(4):1157-1165. doi: 10.1002/jmri.26335. Epub 2018 Dec 15.

MRI for the assessment of malignancy in BI-RADS 4 mammographic microcalcifications.

PLoS One. 2017 Nov 30;12(11):e0188679. doi: 10.1371/journal.pone.0188679. eCollection 2017.

ROC or FROC? It depends on the research question.

Med Phys. 2017 May;44(5):1603-1606. doi: 10.1002/mp.12151. Epub 2017 Mar 17.

Preoperative axillary lymph node evaluation in breast cancer patients by breast magnetic resonance imaging (MRI): Can breast MRI exclude advanced nodal disease?

Eur Radiol. 2016 Nov;26(11):3865-3873. doi: 10.1007/s00330-016-4235-4. Epub 2016 Feb 2.

Comparison of the diagnostic performance of digital breast tomosynthesis and magnetic resonance imaging added to digital mammography in women with known breast cancers.

Eur Radiol. 2016 Jun;26(6):1556-64. doi: 10.1007/s00330-015-3998-3. Epub 2015 Sep 16.

Diffusion-weighted MRI for uveal melanoma liver metastasis detection.

Eur Radiol. 2015 Aug;25(8):2263-73. doi: 10.1007/s00330-015-3662-y. Epub 2015 Feb 26.

Diffusion-weighted and T2-weighted MR imaging for colorectal liver metastases detection in a rat model at 7 T: a comparative study using histological examination as reference.

Eur Radiol. 2013 Aug;23(8):2156-64. doi: 10.1007/s00330-013-2789-y. Epub 2013 Mar 2.

Bilateral contrast-enhanced dual-energy digital mammography: feasibility and comparison with conventional digital mammography and MR imaging in women with known breast carcinoma.

Radiology. 2013 Mar;266(3):743-51. doi: 10.1148/radiol.12121084. Epub 2012 Dec 6.

Detection of noncalcified pulmonary nodules on low-dose MDCT: comparison of the sensitivity of two CAD systems by using a double reference standard.

Radiol Med. 2012 Sep;117(6):953-67. doi: 10.1007/s11547-012-0795-9. Epub 2012 Feb 10.

本文引用的文献

On comparing methods for discriminating between actually negative and actually positive subjects with FROC type data.

Med Phys. 2008 Apr;35(4):1547-58. doi: 10.1118/1.2890410.

Assessment of medical imaging systems and computer aids: a tutorial review.

Acad Radiol. 2007 Jun;14(6):723-48. doi: 10.1016/j.acra.2007.03.001.

Analysis of location specific observer performance data: validated extensions of the jackknife free-response (JAFROC) method.

Acad Radiol. 2006 Oct;13(10):1187-93. doi: 10.1016/j.acra.2006.06.016.

ROC curves predicted by a model of visual search.

Phys Med Biol. 2006 Jul 21;51(14):3463-82. doi: 10.1088/0031-9155/51/14/013. Epub 2006 Jul 6.

A search model and figure of merit for observer data acquired according to the free-response paradigm.

Phys Med Biol. 2006 Jul 21;51(14):3449-62. doi: 10.1088/0031-9155/51/14/012. Epub 2006 Jul 6.

Observer studies involving detection and localization: modeling, analysis, and validation.

Med Phys. 2004 Aug;31(8):2313-30. doi: 10.1118/1.1769352.

Location of adenomas missed by optical colonoscopy.

Ann Intern Med. 2004 Sep 7;141(5):352-9. doi: 10.7326/0003-4819-141-5-200409070-00009.

Maximum likelihood fitting of FROC curves under an initial-detection-and-candidate-analysis model.

Med Phys. 2002 Dec;29(12):2861-70. doi: 10.1118/1.1524631.

Data analysis for detection and localization of multiple abnormalities with application to mammography.

Acad Radiol. 2000 Jul;7(7):516-25. doi: 10.1016/s1076-6332(00)80324-4.

Bootstrap estimation of diagnostic accuracy with patient-clustered data.

Acad Radiol. 2000 Jun;7(6):413-9. doi: 10.1016/s1076-6332(00)80381-5.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

病变检测的患者水平分析中的偏差、风险低估和统计效力损失。

Bias, underestimation of risk, and loss of statistical power in patient-level analyses of lesion detection.

机构信息

出版信息

PURPOSE

METHODS

RESULTS

CONCLUSION

目的

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献