靶向迁移学习以提高小型医学物理数据集的性能。

Targeted transfer learning to improve performance in small medical physics datasets.

机构信息

Master of Science in Data Science, University of San Francisco, San Francisco, CA, 94105, USA.

Department of Radiation Oncology, University of California San Francisco, San Francisco, CA, 94158, USA.

出版信息

Med Phys. 2020 Dec;47(12):6246-6256. doi: 10.1002/mp.14507. Epub 2020 Oct 25.

DOI:10.1002/mp.14507

PMID:33007112

Abstract

PURPOSE

To perform an in-depth evaluation of current state of the art techniques in training neural networks to identify appropriate approaches in small datasets.

METHOD

In total, 112,120 frontal-view X-ray images from the NIH ChestXray14 dataset were used in our analysis. Two tasks were studied: unbalanced multi-label classification of 14 diseases, and binary classification of pneumonia vs non-pneumonia. All datasets were randomly split into training, validation, and testing (70%, 10%, and 20%). Two popular convolution neural networks (CNNs), DensNet121 and ResNet50, were trained using PyTorch. We performed several experiments to test: (a) whether transfer learning using pretrained networks on ImageNet are of value to medical imaging/physics tasks (e.g., predicting toxicity from radiographic images after training on images from the internet), (b) whether using pretrained networks trained on problems that are similar to the target task helps transfer learning (e.g., using X-ray pretrained networks for X-ray target tasks), (c) whether freeze deep layers or change all weights provides an optimal transfer learning strategy, (d) the best strategy for the learning rate policy, and (e) what quantity of data is needed in order to appropriately deploy these various strategies (N = 50 to N = 77 880).

RESULTS

In the multi-label problem, DensNet121 needed at least 1600 patients to be comparable to, and 10 000 to outperform, radiomics-based logistic regression. In classifying pneumonia vs non-pneumonia, both CNN and radiomics-based methods performed poorly when N < 2000. For small datasets ( < 2000), however, a significant boost in performance (>15% increase on AUC) comes from a good selection of the transfer learning dataset, dropout, cycling learning rate, and freezing and unfreezing of deep layers as training progresses. In contrast, if sufficient data are available (>35 000), little or no tweaking is needed to obtain impressive performance. While transfer learning using X-ray images from other anatomical sites improves performance, we also observed a similar boost by using pretrained networks from ImageNet. Having source images from the same anatomical site, however, outperforms every other methodology, by up to 15%. In this case, DL models can be trained with as little as N = 50.

CONCLUSIONS

While training DL models in small datasets (N < 2000) is challenging, no tweaking is necessary for bigger datasets (N > 35 000). Using transfer learning with images from the same anatomical site can yield remarkable performance in new tasks with as few as N = 50. Surprisingly, we did not find any advantage for using images from other anatomical sites over networks that have been trained using ImageNet. This indicates that features learned may not be as general as currently believed, and performance decays rapidly even by just changing the anatomical site of the images.

摘要

目的

深入评估当前用于识别小数据集合适方法的神经网络训练技术现状。

方法

我们共分析了来自 NIH ChestXray14 数据集的 112120 张正位 X 射线图像。研究了两个任务：14 种疾病的不平衡多标签分类和肺炎与非肺炎的二分类。所有数据集均随机分为训练集、验证集和测试集（70%、10%和 20%）。使用 PyTorch 训练了两个流行的卷积神经网络（CNN），即 DensNet121 和 ResNet50。我们进行了多项实验来测试：（a）在 ImageNet 上使用预训练网络进行迁移学习是否对医学成像/物理任务有价值（例如，在训练来自互联网的图像的毒性后，从射线图像预测毒性），（b）使用与目标任务相似的预训练网络是否有助于迁移学习（例如，使用 X 射线预训练网络进行 X 射线目标任务），（c）冻结深层或更改所有权重是否提供最佳迁移学习策略，（d）最佳学习率策略，以及（e）为了适当部署这些各种策略需要多少数据量（N=50 至 N=77880）。

结果

在多标签问题中，DensNet121 需要至少 1600 名患者才能与基于放射组学的逻辑回归相媲美，并需要 10000 名患者才能超越基于放射组学的逻辑回归。在肺炎与非肺炎的分类中，当 N<2000 时，CNN 和基于放射组学的方法的性能都很差。然而，对于小数据集（<2000），通过选择合适的迁移学习数据集、随机失活、循环学习率以及冻结和解冻深层随着训练的进行，可以显著提高性能（AUC 提高超过 15%）。相比之下，如果有足够的数据（>35000），则几乎不需要或不需要进行调整即可获得令人印象深刻的性能。虽然使用来自其他解剖部位的 X 射线图像进行迁移学习可以提高性能，但我们也观察到使用来自 ImageNet 的预训练网络也会带来类似的提升。然而，使用来自同一解剖部位的源图像可以通过高达 15%的优势胜过其他任何方法。在这种情况下，DL 模型可以使用 N=50 进行训练。

结论

虽然在小数据集（N<2000）中训练 DL 模型具有挑战性，但对于更大的数据集（N>35000），则无需进行调整。使用来自同一解剖部位的图像进行迁移学习，可以在新任务中实现显著的性能，所需数据量仅为 N=50。令人惊讶的是，我们没有发现使用来自其他解剖部位的图像比使用经过 ImageNet 训练的网络有任何优势。这表明所学习的特征可能不如目前认为的那么通用，并且即使只是改变图像的解剖部位，性能也会迅速下降。

相似文献

Targeted transfer learning to improve performance in small medical physics datasets.

Med Phys. 2020 Dec;47(12):6246-6256. doi: 10.1002/mp.14507. Epub 2020 Oct 25.

Deep Feature Learning from a Hospital-Scale Chest X-ray Dataset with Application to TB Detection on a Small-Scale Dataset.

Annu Int Conf IEEE Eng Med Biol Soc. 2019 Jul;2019:4076-4079. doi: 10.1109/EMBC.2019.8856729.

Exploring deep learning radiomics for classifying osteoporotic vertebral fractures in X-ray images.

Front Endocrinol (Lausanne). 2024 Mar 28;15:1370838. doi: 10.3389/fendo.2024.1370838. eCollection 2024.

TEM virus images: Benchmark dataset and deep learning classification.

Comput Methods Programs Biomed. 2021 Sep;209:106318. doi: 10.1016/j.cmpb.2021.106318. Epub 2021 Jul 29.

Deep learning, reusable and problem-based architectures for detection of consolidation on chest X-ray images.

Comput Methods Programs Biomed. 2020 Mar;185:105162. doi: 10.1016/j.cmpb.2019.105162. Epub 2019 Oct 31.

Deep Learning Pre-training Strategy for Mammogram Image Classification: an Evaluation Study.

J Digit Imaging. 2020 Oct;33(5):1257-1265. doi: 10.1007/s10278-020-00369-3.

Brain tumor classification for MR images using transfer learning and fine-tuning.

Comput Med Imaging Graph. 2019 Jul;75:34-46. doi: 10.1016/j.compmedimag.2019.05.001. Epub 2019 May 18.

A deep learning- and partial least square regression-based model observer for a low-contrast lesion detection task in CT.

Med Phys. 2019 May;46(5):2052-2063. doi: 10.1002/mp.13500. Epub 2019 Apr 1.

AI-driven deep convolutional neural networks for chest X-ray pathology identification.

J Xray Sci Technol. 2022;30(2):365-376. doi: 10.3233/XST-211082.

Deep Convolutional Neural Networks for Computer-Aided Detection: CNN Architectures, Dataset Characteristics and Transfer Learning.

IEEE Trans Med Imaging. 2016 May;35(5):1285-98. doi: 10.1109/TMI.2016.2528162. Epub 2016 Feb 11.

引用本文的文献

Applications and Performance of Artificial Intelligence in Spinal Metastasis Imaging: A Systematic Review.

J Clin Med. 2025 Aug 20;14(16):5877. doi: 10.3390/jcm14165877.

Integrating multimodal ultrasound imaging for improved radiomics sentinel lymph node assessment in breast cancer.

Gland Surg. 2025 Jul 31;14(7):1348-1365. doi: 10.21037/gs-2025-223. Epub 2025 Jul 25.

Transfer Learning-Based Integration of Dual Imaging Modalities for Enhanced Classification Accuracy in Confocal Laser Endomicroscopy of Lung Cancer.

Cancers (Basel). 2025 Feb 11;17(4):611. doi: 10.3390/cancers17040611.

Development of an AI-Assisted Embryo Selection System Using Iberian Ribbed Newts for Embryo-Fetal Development Toxicity Testing.

Yonago Acta Med. 2024 Aug 27;67(3):233-241. doi: 10.33160/yam.2024.08.011. eCollection 2024 Aug.

Omics-imaging signature-based nomogram to predict the progression-free survival of patients with hepatocellular carcinoma after transcatheter arterial chemoembolization.

World J Clin Cases. 2024 Jun 26;12(18):3340-3350. doi: 10.12998/wjcc.v12.i18.3340.

Contrastive Learning vs. Self-Learning vs. Deformable Data Augmentation in Semantic Segmentation of Medical Images.

J Imaging Inform Med. 2024 Dec;37(6):3217-3230. doi: 10.1007/s10278-024-01159-x. Epub 2024 Jun 10.

AI Applications to Breast MRI: Today and Tomorrow.

J Magn Reson Imaging. 2024 Dec;60(6):2290-2308. doi: 10.1002/jmri.29358. Epub 2024 Apr 5.

Diagnosis of Salivary Gland Tumors Using Transfer Learning with Fine-Tuning and Gradual Unfreezing.

Diagnostics (Basel). 2023 Oct 29;13(21):3333. doi: 10.3390/diagnostics13213333.

Sensor-Location-Specific Joint Acquisition of Peripheral Artery Bioimpedance and Photoplethysmogram for Wearable Applications.

Sensors (Basel). 2023 Aug 11;23(16):7111. doi: 10.3390/s23167111.

Linear fine-tuning: a linear transformation based transfer strategy for deep MRI reconstruction.

Front Neurosci. 2023 Jun 20;17:1202143. doi: 10.3389/fnins.2023.1202143. eCollection 2023.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

靶向迁移学习以提高小型医学物理数据集的性能。

Targeted transfer learning to improve performance in small medical physics datasets.

机构信息

出版信息

PURPOSE

METHOD

RESULTS

CONCLUSIONS

目的

方法

结果

结论

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献