College of Computing and Digital Media, DePaul University, 243 S. Wabash Avenue, Chicago, IL 60604, USA.
J Digit Imaging. 2012 Jun;25(3):423-36. doi: 10.1007/s10278-011-9445-3.
Traditionally, image studies evaluating the effectiveness of computer-aided diagnosis (CAD) use a single label from a medical expert compared with a single label produced by CAD. The purpose of this research is to present a CAD system based on Belief Decision Tree classification algorithm, capable of learning from probabilistic input (based on intra-reader variability) and providing probabilistic output. We compared our approach against a traditional decision tree approach with respect to a traditional performance metric (accuracy) and a probabilistic one (area under the distance-threshold curve-AuC(dt)). The probabilistic classification technique showed notable performance improvement in comparison with the traditional one with respect to both evaluation metrics. Specifically, when applying cross-validation technique on the training subset of instances, boosts of 28.26% and 30.28% were noted for the probabilistic approach with respect to accuracy and AuC(dt), respectively. Furthermore, on the validation subset of instances, boosts of 20.64% and 23.21% were noted again for the probabilistic approach with respect to the same two metrics. In addition, we compared our CAD system results with diagnostic data available for a small subset of the Lung Image Database Consortium database. We discovered that when our CAD system errs, it generally does so with low confidence. Predictions produced by the system also agree with diagnoses of truly benign nodules more often than radiologists, offering the possibility of reducing the false positives.
传统上,评估计算机辅助诊断 (CAD) 有效性的图像研究使用医学专家的单一标签与 CAD 生成的单一标签进行比较。本研究的目的是提出一种基于置信度决策树分类算法的 CAD 系统,该系统能够从概率输入(基于读者内变异性)中学习,并提供概率输出。我们将我们的方法与传统的决策树方法进行了比较,比较了传统的性能指标(准确性)和概率指标(距离-阈值曲线下的面积-AuC(dt))。与传统方法相比,概率分类技术在这两个评估指标上都表现出了显著的性能改进。具体来说,在训练实例子集上应用交叉验证技术时,与准确性和 AuC(dt)相比,概率方法的准确性分别提高了 28.26%和 30.28%。此外,在验证实例子集上,与这两个相同的指标相比,概率方法再次提高了 20.64%和 23.21%。此外,我们还将我们的 CAD 系统结果与 Lung Image Database Consortium 数据库的一小部分诊断数据进行了比较。我们发现,当我们的 CAD 系统出错时,它通常是低可信度的。该系统生成的预测与真正良性结节的诊断也更为一致,从而有可能减少假阳性。