多源数据增强对用于乳腺钼靶异常分类的卷积神经网络性能的影响。

Impact of multi-source data augmentation on performance of convolutional neural networks for abnormality classification in mammography.

作者信息

Hwang InChan, Trivedi Hari, Brown-Mulry Beatrice, Zhang Linglin, Nalla Vineela, Gastounioti Aimilia, Gichoya Judy, Seyyed-Kalantari Laleh, Banerjee Imon, Woo MinJae

机构信息

School of Data Science and Analytics, Kennesaw State University, Kennesaw, GA, United States.

Department of Radiology, Emory University, Atlanta, GA, United States.

出版信息

Front Radiol. 2023 Jun 16;3:1181190. doi: 10.3389/fradi.2023.1181190. eCollection 2023.

DOI:10.3389/fradi.2023.1181190

PMID:37588666

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10426498/

Abstract

INTRODUCTION

To date, most mammography-related AI models have been trained using either film or digital mammogram datasets with little overlap. We investigated whether or not combining film and digital mammography during training will help or hinder modern models designed for use on digital mammograms.

METHODS

To this end, a total of six binary classifiers were trained for comparison. The first three classifiers were trained using images only from Emory Breast Imaging Dataset (EMBED) using ResNet50, ResNet101, and ResNet152 architectures. The next three classifiers were trained using images from EMBED, Curated Breast Imaging Subset of Digital Database for Screening Mammography (CBIS-DDSM), and Digital Database for Screening Mammography (DDSM) datasets. All six models were tested only on digital mammograms from EMBED.

RESULTS

The results showed that performance degradation to the customized ResNet models was statistically significant overall when EMBED dataset was augmented with CBIS-DDSM/DDSM. While the performance degradation was observed in all racial subgroups, some races are subject to more severe performance drop as compared to other races.

DISCUSSION

The degradation may potentially be due to ( 1) a mismatch in features between film-based and digital mammograms ( 2) a mismatch in pathologic and radiological information. In conclusion, use of both film and digital mammography during training may hinder modern models designed for breast cancer screening. Caution is required when combining film-based and digital mammograms or when utilizing pathologic and radiological information simultaneously.

摘要

引言

迄今为止，大多数与乳腺钼靶相关的人工智能模型都是使用胶片或数字乳腺钼靶数据集进行训练的，两者几乎没有重叠。我们研究了在训练过程中结合胶片和数字乳腺钼靶是否会有助于或阻碍为数字乳腺钼靶设计的现代模型。

方法

为此，总共训练了六个二元分类器进行比较。前三个分类器使用仅来自埃默里乳腺影像数据集（EMBED）的图像，采用ResNet50、ResNet101和ResNet152架构进行训练。接下来的三个分类器使用来自EMBED、数字乳腺钼靶筛查数据库（DDSM）的精选乳腺影像子集（CBIS-DDSM）和DDSM数据集的图像进行训练。所有六个模型仅在来自EMBED的数字乳腺钼靶上进行测试。

结果

结果表明，当EMBED数据集增加CBIS-DDSM/DDSM时，定制的ResNet模型的性能总体下降具有统计学意义。虽然在所有种族亚组中都观察到了性能下降，但与其他种族相比，某些种族的性能下降更为严重。

讨论

性能下降可能潜在地归因于（1）基于胶片的和数字乳腺钼靶之间的特征不匹配（2）病理和放射学信息的不匹配。总之，在训练过程中同时使用胶片和数字乳腺钼靶可能会阻碍为乳腺癌筛查设计的现代模型。在结合基于胶片的和数字乳腺钼靶或同时利用病理和放射学信息时需要谨慎。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f9c0/10426498/c9aff5069a19/fradi-03-1181190-g001.jpg

相似文献

Impact of multi-source data augmentation on performance of convolutional neural networks for abnormality classification in mammography.多源数据增强对用于乳腺钼靶异常分类的卷积神经网络性能的影响。

Front Radiol. 2023 Jun 16;3:1181190. doi: 10.3389/fradi.2023.1181190. eCollection 2023.

Generating Full-Field Digital Mammogram From Digitized Screen-Film Mammogram for Breast Cancer Screening With High-Resolution Generative Adversarial Network.使用高分辨率生成对抗网络从数字化屏-片乳腺造影片生成全场数字化乳腺造影片用于乳腺癌筛查

Front Oncol. 2022 Apr 29;12:868257. doi: 10.3389/fonc.2022.868257. eCollection 2022.

Comparison of segmentation-free and segmentation-dependent computer-aided diagnosis of breast masses on a public mammography dataset.在一个公共乳腺X线摄影数据集上对乳腺肿块的无分割和基于分割的计算机辅助诊断进行比较。

J Biomed Inform. 2021 Jan;113:103656. doi: 10.1016/j.jbi.2020.103656. Epub 2020 Dec 11.

Deep Learning to Improve Breast Cancer Detection on Screening Mammography.深度学习在提高筛查性乳房 X 光摄影乳腺癌检测中的应用。

Sci Rep. 2019 Aug 29;9(1):12495. doi: 10.1038/s41598-019-48995-4.

A framework for breast cancer classification using Multi-DCNNs.基于多 DCNN 的乳腺癌分类框架。

Comput Biol Med. 2021 Apr;131:104245. doi: 10.1016/j.compbiomed.2021.104245. Epub 2021 Jan 29.

Evaluation of data augmentation via synthetic images for improved breast mass detection on mammograms using deep learning.通过合成图像进行数据增强以利用深度学习改进乳腺钼靶片上乳腺肿块检测的评估

J Med Imaging (Bellingham). 2020 Jan;7(1):012703. doi: 10.1117/1.JMI.7.1.012703. Epub 2019 Nov 22.

Automatic mass detection in mammograms using deep convolutional neural networks.使用深度卷积神经网络在乳腺X光片中进行自动肿块检测。

J Med Imaging (Bellingham). 2019 Jul;6(3):031409. doi: 10.1117/1.JMI.6.3.031409. Epub 2019 Feb 20.

Mammography Datasets for Neural Networks-Survey.用于神经网络的乳腺X线摄影数据集——综述

J Imaging. 2023 May 10;9(5):95. doi: 10.3390/jimaging9050095.

A YOLO-based AI system for classifying calcifications on spot magnification mammograms.基于 YOLO 的 AI 系统用于分类乳腺局部放大摄影中的钙化。

Biomed Eng Online. 2023 May 27;22(1):54. doi: 10.1186/s12938-023-01115-w.

A divide and conquer approach to maximise deep learning mammography classification accuracies.一种分而治之的方法，可最大限度地提高深度学习在乳腺 X 光摄影分类中的准确率。

PLoS One. 2023 May 26;18(5):e0280841. doi: 10.1371/journal.pone.0280841. eCollection 2023.

本文引用的文献

The EMory BrEast imaging Dataset (EMBED): A Racially Diverse, Granular Dataset of 3.4 Million Screening and Diagnostic Mammographic Images.埃默里乳腺成像数据集（EMBED）：一个包含340万张筛查和诊断性乳腺钼靶图像的种族多样化、详细的数据集。

Radiol Artif Intell. 2023 Jan 4;5(1):e220047. doi: 10.1148/ryai.220047. eCollection 2023 Jan.

Breast Cancer Mammograms Classification Using Deep Neural Network and Entropy-Controlled Whale Optimization Algorithm.基于深度神经网络和熵控制鲸鱼优化算法的乳腺癌乳房X光片分类

Diagnostics (Basel). 2022 Feb 21;12(2):557. doi: 10.3390/diagnostics12020557.

A Novel Multistage Transfer Learning for Ultrasound Breast Cancer Image Classification.一种用于超声乳腺癌图像分类的新型多阶段迁移学习

Diagnostics (Basel). 2022 Jan 6;12(1):135. doi: 10.3390/diagnostics12010135.

Cancer statistics, 2022.癌症统计数据，2022 年。

CA Cancer J Clin. 2022 Jan;72(1):7-33. doi: 10.3322/caac.21708. Epub 2022 Jan 12.

Uncertainty quantification in skin cancer classification using three-way decision-based Bayesian deep learning.基于三向决策贝叶斯深度学习的皮肤癌分类中的不确定性量化。

Comput Biol Med. 2021 Aug;135:104418. doi: 10.1016/j.compbiomed.2021.104418. Epub 2021 Apr 28.

Deep learning-based auto-segmentation of organs at risk in high-dose rate brachytherapy of cervical cancer.基于深度学习的宫颈癌高剂量率近距离放疗中危及器官的自动分割。

Radiother Oncol. 2021 Jun;159:231-240. doi: 10.1016/j.radonc.2021.03.030. Epub 2021 Apr 6.

Robust breast cancer detection in mammography and digital breast tomosynthesis using an annotation-efficient deep learning approach.利用高效标注的深度学习方法在乳腺 X 线摄影和数字乳腺断层合成术中进行稳健的乳腺癌检测。

Nat Med. 2021 Feb;27(2):244-249. doi: 10.1038/s41591-020-01174-9. Epub 2021 Jan 11.

Fighting against COVID-19: A novel deep learning model based on YOLO-v2 with ResNet-50 for medical face mask detection.抗击新冠疫情：一种基于带有ResNet-50的YOLO-v2的新型深度学习模型用于医用口罩检测

Sustain Cities Soc. 2021 Feb;65:102600. doi: 10.1016/j.scs.2020.102600. Epub 2020 Nov 12.

A Novel Medical Diagnosis model for COVID-19 infection detection based on Deep Features and Bayesian Optimization.一种基于深度特征和贝叶斯优化的新型新冠病毒感染检测医学诊断模型。

Appl Soft Comput. 2020 Dec;97:106580. doi: 10.1016/j.asoc.2020.106580. Epub 2020 Jul 28.

Breast Cancer Histopathology Image Classification Using an Ensemble of Deep Learning Models.基于深度学习模型集成的乳腺癌病理图像分类。

Sensors (Basel). 2020 Aug 5;20(16):4373. doi: 10.3390/s20164373.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

多源数据增强对用于乳腺钼靶异常分类的卷积神经网络性能的影响。

Impact of multi-source data augmentation on performance of convolutional neural networks for abnormality classification in mammography.

作者信息

机构信息

出版信息

INTRODUCTION

METHODS

RESULTS

DISCUSSION

引言

方法

结果

讨论

相似文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

本文引用的文献