Hwang InChan, Trivedi Hari, Brown-Mulry Beatrice, Zhang Linglin, Nalla Vineela, Gastounioti Aimilia, Gichoya Judy, Seyyed-Kalantari Laleh, Banerjee Imon, Woo MinJae
School of Data Science and Analytics, Kennesaw State University, Kennesaw, GA, United States.
Department of Radiology, Emory University, Atlanta, GA, United States.
Front Radiol. 2023 Jun 16;3:1181190. doi: 10.3389/fradi.2023.1181190. eCollection 2023.
To date, most mammography-related AI models have been trained using either film or digital mammogram datasets with little overlap. We investigated whether or not combining film and digital mammography during training will help or hinder modern models designed for use on digital mammograms.
To this end, a total of six binary classifiers were trained for comparison. The first three classifiers were trained using images only from Emory Breast Imaging Dataset (EMBED) using ResNet50, ResNet101, and ResNet152 architectures. The next three classifiers were trained using images from EMBED, Curated Breast Imaging Subset of Digital Database for Screening Mammography (CBIS-DDSM), and Digital Database for Screening Mammography (DDSM) datasets. All six models were tested only on digital mammograms from EMBED.
The results showed that performance degradation to the customized ResNet models was statistically significant overall when EMBED dataset was augmented with CBIS-DDSM/DDSM. While the performance degradation was observed in all racial subgroups, some races are subject to more severe performance drop as compared to other races.
The degradation may potentially be due to ( 1) a mismatch in features between film-based and digital mammograms ( 2) a mismatch in pathologic and radiological information. In conclusion, use of both film and digital mammography during training may hinder modern models designed for breast cancer screening. Caution is required when combining film-based and digital mammograms or when utilizing pathologic and radiological information simultaneously.
迄今为止,大多数与乳腺钼靶相关的人工智能模型都是使用胶片或数字乳腺钼靶数据集进行训练的,两者几乎没有重叠。我们研究了在训练过程中结合胶片和数字乳腺钼靶是否会有助于或阻碍为数字乳腺钼靶设计的现代模型。
为此,总共训练了六个二元分类器进行比较。前三个分类器使用仅来自埃默里乳腺影像数据集(EMBED)的图像,采用ResNet50、ResNet101和ResNet152架构进行训练。接下来的三个分类器使用来自EMBED、数字乳腺钼靶筛查数据库(DDSM)的精选乳腺影像子集(CBIS-DDSM)和DDSM数据集的图像进行训练。所有六个模型仅在来自EMBED的数字乳腺钼靶上进行测试。
结果表明,当EMBED数据集增加CBIS-DDSM/DDSM时,定制的ResNet模型的性能总体下降具有统计学意义。虽然在所有种族亚组中都观察到了性能下降,但与其他种族相比,某些种族的性能下降更为严重。
性能下降可能潜在地归因于(1)基于胶片的和数字乳腺钼靶之间的特征不匹配(2)病理和放射学信息的不匹配。总之,在训练过程中同时使用胶片和数字乳腺钼靶可能会阻碍为乳腺癌筛查设计的现代模型。在结合基于胶片的和数字乳腺钼靶或同时利用病理和放射学信息时需要谨慎。