Barufaldi B, Gomes J V, Filho Tm Silva, do Rêgo T G, Malheiros Y, Vent T L, Gastounioti A, Maidment Ada
Department of Radiology, University of Pennsylvania, 3640 Hamilton Walk, PA 19104, United States.
Center of Informatics, Federal University of Paraíba, Rua dos Escoteiros s/n, João Pessoa, PB 58058-600, Brazil.
Pattern Recognit. 2024 Sep;153. doi: 10.1016/j.patcog.2024.110494. Epub 2024 Apr 11.
The adoption of artificial intelligence (AI) in medical imaging requires careful evaluation of machine-learning algorithms. We propose the use of a "deep virtual clinical trial" (DeepVCT) method to effectively evaluate the performance of AI algorithms. In this paper, DeepVCTs have been proposed to elucidate limitations of AI applications and predictions of clinical outcomes, avoiding biases in study designs. The DeepVCT method was used to evaluate the performance of nnU-Net models in assessing volumetric breast density (VBD) from digital breast tomosynthesis (DBT) images. In total, 2,010 anatomical breast models were simulated. Projections were simulated using the acquisition geometry of a clinical DBT system. The projections were reconstructed using 0.1, 0.2, and 0.5 mm plane spacing. nnU-Net models were developed using the center-most planes of the reconstructions with the respective ground-truth. The results show that the accuracy of the nnU-Net improves significantly with DBT images reconstructed with 0.1 mm plane spacing (78.4×205.3×40.1 mm). The segmentations resulted in Dice values up to 0.84 with area under the receiver operating characteristic curve of 0.92. The optimization of plane spacing for VBD assessment was used as an exemplar of a DeepVCT application, allowing us to interpret better the input parameters and outcomes of the nnU-Net. Thus, DeepVCTs can provide a plethora of evidence to predict the efficacy of these algorithms using large-scale simulation-based data.
在医学成像中采用人工智能(AI)需要对机器学习算法进行仔细评估。我们建议使用“深度虚拟临床试验”(DeepVCT)方法来有效评估AI算法的性能。在本文中,已提出使用DeepVCT来阐明AI应用的局限性和临床结果预测,避免研究设计中的偏差。DeepVCT方法用于评估nnU-Net模型在从数字乳腺断层合成(DBT)图像评估乳腺体积密度(VBD)方面的性能。总共模拟了2010个乳腺解剖模型。使用临床DBT系统的采集几何结构模拟投影。使用0.1、0.2和0.5毫米的平面间距重建投影。使用重建的最中心平面以及各自的真实情况开发nnU-Net模型。结果表明,对于以0.1毫米平面间距(78.4×205.3×40.1毫米)重建的DBT图像,nnU-Net的准确性显著提高。分割结果的Dice值高达0.84,受试者工作特征曲线下面积为0.92。将VBD评估的平面间距优化用作DeepVCT应用的一个示例,使我们能够更好地解释nnU-Net的输入参数和结果。因此,DeepVCT可以提供大量证据,以使用基于大规模模拟的数据预测这些算法的疗效。