Li Qiang
Department of Radiology, University of Chicago, Chicago, IL 60637, USA.
Acad Radiol. 2007 Aug;14(8):985-91. doi: 10.1016/j.acra.2007.04.015.
Computer-aided diagnostic (CAD) schemes have been developed for assisting radiologists in the detection of various lesions in medical images. The reliable evaluation of CAD schemes is an important task in the field of CAD research.
Many evaluation approaches have been proposed for evaluating the performance of various CAD schemes in the past. However, some important issues in the evaluation of CAD schemes have not been systematically analyzed. The first important issue is the analysis and comparison of various evaluation methods in terms of certain characteristics. The second includes the analysis of pitfalls in the incorrect use of various evaluation methods and the effective approaches to the reduction of the bias and variance caused by these pitfalls. We attempt to address the first important issue in details in this article by conducting Monte Carlo simulation experiments, and to discuss the second issue in the Discussion section.
No single evaluation method is universally superior to the others; different situations of CAD applications require different evaluation methods, as recommended in this article. Bias and variance in the estimated performance levels caused by various pitfalls can be reduced considerably by the correct use of good evaluation methods.
This article would be useful to researchers in the field of CAD research for selecting appropriate evaluation methods and for improving the reliability of the estimated performance of their CAD schemes.
已开发出计算机辅助诊断(CAD)方案,以协助放射科医生检测医学图像中的各种病变。对CAD方案进行可靠评估是CAD研究领域的一项重要任务。
过去已提出许多评估方法来评估各种CAD方案的性能。然而,CAD方案评估中的一些重要问题尚未得到系统分析。第一个重要问题是根据某些特征对各种评估方法进行分析和比较。第二个问题包括分析各种评估方法使用不当的陷阱,以及减少这些陷阱所导致的偏差和方差的有效方法。本文试图通过进行蒙特卡罗模拟实验来详细解决第一个重要问题,并在讨论部分讨论第二个问题。
没有一种评估方法普遍优于其他方法;如本文所建议的,CAD应用的不同情况需要不同的评估方法。通过正确使用良好的评估方法,可以大大减少各种陷阱在估计性能水平时造成的偏差和方差。
本文对CAD研究领域的研究人员选择合适的评估方法以及提高其CAD方案估计性能的可靠性将有所帮助。