训练数据量对卷积神经网络气胸分类器性能的影响。

Effect of Training Data Volume on Performance of Convolutional Neural Network Pneumothorax Classifiers.

机构信息

Department of Diagnostic Imaging, National University Hospital, 5 Lower Kent Ridge Rd, Queenstown, 119074, Singapore.

Saw Swee Hock School of Public Health, School of Computer Science, Yong Loo Lin School of Medicine, National University of Singapore, 12 Science Drive 2, #10-01, Queenstown, 117549, Singapore.

出版信息

J Digit Imaging. 2022 Aug;35(4):881-892. doi: 10.1007/s10278-022-00594-y. Epub 2022 Mar 3.

DOI:10.1007/s10278-022-00594-y

PMID:35239091

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9485337/

Abstract

Large datasets with high-quality labels required to train deep neural networks are challenging to obtain in the radiology domain. This work investigates the effect of training dataset size on the performance of deep learning classifiers, focusing on chest radiograph pneumothorax detection as a proxy visual task in the radiology domain. Two open-source datasets (ChestX-ray14 and CheXpert) comprising 291,454 images were merged and convolutional neural networks trained with stepwise increase in training dataset sizes. Model iterations at each dataset volume were evaluated on an external test set of 525 emergency department chest radiographs. Learning curve analysis was performed to fit the observed AUCs for all models generated. For all three network architectures tested, model AUCs and accuracy increased rapidly from 2 × 10 to 20 × 10 training samples, with more gradual increase until the maximum training dataset size of 291 × 10 images. AUCs for models trained with the maximum tested dataset size of 291 × 10 images were significantly higher than models trained with 20 × 10 images: ResNet-50: AUC = 0.86, AUC = 0.95, p < 0.001; DenseNet-121 AUC = 0.85, AUC = 0.93, p < 0.001; EfficientNet AUC = 0.92, AUC = 0.98, p < 0.001. Our study established learning curves describing the relationship between dataset training size and model performance of deep learning convolutional neural networks applied to a typical radiology binary classification task. These curves suggest a point of diminishing performance returns for increasing training data volumes, which algorithm developers should consider given the high costs of obtaining and labelling radiology data.

摘要

大型、高质量标注的数据集对于训练深度神经网络来说具有挑战性，在放射学领域尤其如此。本研究旨在探讨训练数据集大小对深度学习分类器性能的影响，以胸部 X 光片气胸检测作为放射学领域的代表性视觉任务。我们合并了两个开源数据集（ChestX-ray14 和 CheXpert），共包含 291,454 张图像，并使用逐步增加训练数据集大小的方法训练卷积神经网络。在一个包含 525 张急诊科胸部 X 光片的外部测试集中评估了每个数据集容量的模型迭代。我们进行了学习曲线分析，以拟合所有生成模型的观测 AUC。对于测试的三种网络架构，模型 AUC 和准确率在从 2×10 到 20×10 个训练样本时快速增加，然后在达到 291×10 个图像的最大训练数据集大小时逐渐增加。使用最大测试数据集大小（291×10 个图像）训练的模型的 AUC 明显高于使用 20×10 个图像训练的模型：ResNet-50：AUC=0.86，AUC=0.95，p<0.001；DenseNet-121 AUC=0.85，AUC=0.93，p<0.001；EfficientNet AUC=0.92，AUC=0.98，p<0.001。本研究建立了描述深度学习卷积神经网络应用于典型放射学二分类任务时，数据集训练大小与模型性能之间关系的学习曲线。这些曲线表明，随着训练数据量的增加，性能回报会逐渐减少，算法开发人员应该考虑到获取和标注放射学数据的高成本。

相似文献

Effect of Training Data Volume on Performance of Convolutional Neural Network Pneumothorax Classifiers.训练数据量对卷积神经网络气胸分类器性能的影响。

J Digit Imaging. 2022 Aug;35(4):881-892. doi: 10.1007/s10278-022-00594-y. Epub 2022 Mar 3.

Detection of Pneumothorax with Deep Learning Models: Learning From Radiologist Labels vs Natural Language Processing Model Generated Labels.深度学习模型检测气胸：从放射科医生标签与自然语言处理模型生成标签中学习。

Acad Radiol. 2022 Sep;29(9):1350-1358. doi: 10.1016/j.acra.2021.09.013. Epub 2021 Oct 12.

Automated detection of moderate and large pneumothorax on frontal chest X-rays using deep convolutional neural networks: A retrospective study.使用深度卷积神经网络自动检测正位胸部 X 光片中的中至大量气胸：一项回顾性研究。

PLoS Med. 2018 Nov 20;15(11):e1002697. doi: 10.1371/journal.pmed.1002697. eCollection 2018 Nov.

CheXLocNet: Automatic localization of pneumothorax in chest radiographs using deep convolutional neural networks.CheXLocNet：使用深度卷积神经网络自动定位胸部 X 光片中的气胸。

PLoS One. 2020 Nov 9;15(11):e0242013. doi: 10.1371/journal.pone.0242013. eCollection 2020.

German CheXpert Chest X-ray Radiology Report Labeler.德国 CheXpert 胸部 X 射线放射学报告标签生成器。

Rofo. 2024 Sep;196(9):956-965. doi: 10.1055/a-2234-8268. Epub 2024 Jan 31.

Can AI outperform a junior resident? Comparison of deep neural network to first-year radiology residents for identification of pneumothorax.人工智能能超越初级住院医师吗？深度学习神经网络与第一年放射科住院医师在气胸识别方面的比较。

Emerg Radiol. 2020 Aug;27(4):367-375. doi: 10.1007/s10140-020-01767-4. Epub 2020 Jul 8.

Deep learning prediction of sex on chest radiographs: a potential contributor to biased algorithms.深度学习预测胸部 X 光片上的性别：导致算法产生偏差的潜在因素。

Emerg Radiol. 2022 Apr;29(2):365-370. doi: 10.1007/s10140-022-02019-3. Epub 2022 Jan 10.

Deep Learning Method for Automated Classification of Anteroposterior and Posteroanterior Chest Radiographs.深度学习方法在前后位和后前位胸部 X 线片中的自动分类。

J Digit Imaging. 2019 Dec;32(6):925-930. doi: 10.1007/s10278-019-00208-0.

Deep multi-instance transfer learning for pneumothorax classification in chest X-ray images.基于深度多实例转移学习的胸片气胸分类。

Med Phys. 2022 Jan;49(1):231-243. doi: 10.1002/mp.15328. Epub 2021 Dec 7.

Radiology "forensics": determination of age and sex from chest radiographs using deep learning.放射学“法医学”：使用深度学习从胸部 X 光片中确定年龄和性别。

Emerg Radiol. 2021 Oct;28(5):949-954. doi: 10.1007/s10140-021-01953-y. Epub 2021 Jun 5.

引用本文的文献

In vivo variability of MRI radiomics features in prostate lesions assessed by a test-retest study with repositioning.通过重新定位的重测研究评估前列腺病变中MRI影像组学特征的体内变异性。

Sci Rep. 2025 Aug 13;15(1):29703. doi: 10.1038/s41598-025-09989-7.

Dosing prediction of valproic acid in pediatric patients with epilepsy: population pharmacokinetic model or machine learning model?癫痫患儿丙戊酸的剂量预测：群体药代动力学模型还是机器学习模型？

Eur J Clin Pharmacol. 2025 Jul 5. doi: 10.1007/s00228-025-03874-y.

A low-cost platform for automated cervical cytology: addressing health and socioeconomic challenges in low-resource settings.一种用于自动宫颈细胞学检查的低成本平台：应对资源匮乏地区的健康和社会经济挑战。

Front Med Technol. 2025 Mar 31;7:1531817. doi: 10.3389/fmedt.2025.1531817. eCollection 2025.

Detection of C-shaped mandibular second molars on panoramic radiographs using deep convolutional neural networks.使用深度卷积神经网络在全景片上检测 C 形下颌第二磨牙。

Clin Oral Investig. 2024 Nov 18;28(12):646. doi: 10.1007/s00784-024-06049-8.

Comparison of three artificial intelligence algorithms for automatic cobb angle measurement using teaching data specific to three disease groups.使用针对三个疾病组的特定教学数据比较三种人工智能算法在自动 Cobb 角测量中的应用。

Sci Rep. 2024 Aug 3;14(1):17989. doi: 10.1038/s41598-024-68937-z.

Radiographic chest wall abnormalities in primary spontaneous pneumothorax identified by artificial intelligence.人工智能识别出的原发性自发性气胸的胸部影像学胸壁异常

Heliyon. 2024 Apr 30;10(9):e30023. doi: 10.1016/j.heliyon.2024.e30023. eCollection 2024 May 15.

Automated segmentation and volume prediction in pediatric Wilms' tumor CT using nnu-net.使用 nnu-net 进行小儿肾母细胞瘤 CT 的自动分割和体积预测。

BMC Pediatr. 2024 May 9;24(1):321. doi: 10.1186/s12887-024-04775-2.

Privacy, Please: Safeguarding Medical Data in Imaging AI Using Differential Privacy Techniques.请保护隐私：使用差分隐私技术保护医学影像人工智能中的数据安全。

Radiol Artif Intell. 2024 Jan;6(1):e230560. doi: 10.1148/ryai.230560.

Innovative advances in pediatric radiology: computed tomography reconstruction techniques, photon-counting detector computed tomography, and beyond.儿科放射学的创新进展：计算机断层扫描重建技术、光子计数探测器计算机断层扫描，以及更多。

Pediatr Radiol. 2024 Jan;54(1):1-11. doi: 10.1007/s00247-023-05823-2. Epub 2023 Dec 2.

Deep learning for pneumothorax diagnosis: a systematic review and meta-analysis.深度学习在气胸诊断中的应用：系统评价和荟萃分析。

Eur Respir Rev. 2023 Jun 7;32(168). doi: 10.1183/16000617.0259-2022. Print 2023 Jun 30.

本文引用的文献

Radiol Artif Intell. 2019 Jan 30;1(1):e180031. doi: 10.1148/ryai.2019180031. eCollection 2019 Jan.

Preparing Medical Imaging Data for Machine Learning.医学影像数据的机器学习准备

Radiology. 2020 Apr;295(1):4-15. doi: 10.1148/radiol.2020192224. Epub 2020 Feb 18.

Chest Radiograph Interpretation with Deep Learning Models: Assessment with Radiologist-adjudicated Reference Standards and Population-adjusted Evaluation.深度学习模型在胸部 X 线片解读中的应用：使用经过放射科医师裁定的参考标准和人群校正评估进行评估。

Radiology. 2020 Feb;294(2):421-431. doi: 10.1148/radiol.2019191293. Epub 2019 Dec 3.

Exploring Large-scale Public Medical Image Datasets.探索大规模公共医学图像数据集。

Acad Radiol. 2020 Jan;27(1):106-112. doi: 10.1016/j.acra.2019.10.006. Epub 2019 Nov 6.

Key challenges for delivering clinical impact with artificial intelligence.人工智能实现临床影响的关键挑战。

BMC Med. 2019 Oct 29;17(1):195. doi: 10.1186/s12916-019-1426-2.

Deep-Learning-Based Neural Tissue Segmentation of MRI in Multiple Sclerosis: Effect of Training Set Size.基于深度学习的多发性硬化症磁共振成像神经组织分割：训练集大小的影响

J Magn Reson Imaging. 2020 May;51(5):1487-1496. doi: 10.1002/jmri.26959. Epub 2019 Oct 18.

Sample-Size Determination Methodologies for Machine Learning in Medical Imaging Research: A Systematic Review.机器学习在医学影像学研究中的样本量确定方法：系统评价。

Can Assoc Radiol J. 2019 Nov;70(4):344-353. doi: 10.1016/j.carj.2019.06.002. Epub 2019 Sep 12.

Privacy in the age of medical big data.医疗大数据时代的隐私问题。

Nat Med. 2019 Jan;25(1):37-43. doi: 10.1038/s41591-018-0272-7. Epub 2019 Jan 7.

Assessment of Convolutional Neural Networks for Automated Classification of Chest Radiographs.卷积神经网络在胸部 X 光片自动分类中的评估。

Radiology. 2019 Feb;290(2):537-544. doi: 10.1148/radiol.2018181422. Epub 2018 Nov 13.

Artificial intelligence in fracture detection: transfer learning from deep convolutional neural networks.骨折检测中的人工智能：基于深度卷积神经网络的迁移学习

Clin Radiol. 2018 May;73(5):439-445. doi: 10.1016/j.crad.2017.11.015. Epub 2017 Dec 18.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验