• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

对有限肺炎X光数据上用于合成数据增强的生成模型的批判性评估。

A Critical Assessment of Generative Models for Synthetic Data Augmentation on Limited Pneumonia X-ray Data.

作者信息

Schaudt Daniel, Späte Christian, von Schwerin Reinhold, Reichert Manfred, von Schwerin Marianne, Beer Meinrad, Kloth Christopher

机构信息

Institute of Databases and Information Systems, Ulm University, James-Franck-Ring, 89081 Ulm, Germany.

DASU Transferzentrum für Digitalisierung, Analytics und Data Science Ulm, Olgastraße 94, 89073 Ulm, Germany.

出版信息

Bioengineering (Basel). 2023 Dec 14;10(12):1421. doi: 10.3390/bioengineering10121421.

DOI:10.3390/bioengineering10121421
PMID:38136012
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10741143/
Abstract

In medical imaging, deep learning models serve as invaluable tools for expediting diagnoses and aiding specialized medical professionals in making clinical decisions. However, effectively training deep learning models typically necessitates substantial quantities of high-quality data, a resource often lacking in numerous medical imaging scenarios. One way to overcome this deficiency is to artificially generate such images. Therefore, in this comparative study we train five generative models to artificially increase the amount of available data in such a scenario. This synthetic data approach is evaluated on a a downstream classification task, predicting four causes for pneumonia as well as healthy cases on 1082 chest X-ray images. Quantitative and medical assessments show that a Generative Adversarial Network (GAN)-based approach significantly outperforms more recent diffusion-based approaches on this limited dataset with better image quality and pathological plausibility. We show that better image quality surprisingly does not translate to improved classification performance by evaluating five different classification models and varying the amount of additional training data. Class-specific metrics like precision, recall, and F1-score show a substantial improvement by using synthetic images, emphasizing the data rebalancing effect of less frequent classes. However, overall performance does not improve for most models and configurations, except for a DreamBooth approach which shows a +0.52 improvement in overall accuracy. The large variance of performance impact in this study suggests a careful consideration of utilizing generative models for limited data scenarios, especially with an unexpected negative correlation between image quality and downstream classification improvement.

摘要

在医学成像中,深度学习模型是加快诊断速度并协助专业医疗人员做出临床决策的宝贵工具。然而,有效训练深度学习模型通常需要大量高质量数据,而在众多医学成像场景中,这种资源往往匮乏。克服这一不足的一种方法是人工生成此类图像。因此,在这项对比研究中,我们训练了五个生成模型,以便在这种场景下人工增加可用数据量。这种合成数据方法在一个下游分类任务中进行评估,该任务是在1082张胸部X光图像上预测肺炎的四种病因以及健康病例。定量和医学评估表明,在这个有限的数据集上,基于生成对抗网络(GAN)的方法明显优于最近基于扩散的方法,生成的图像质量更高,病理合理性更强。通过评估五种不同的分类模型并改变额外训练数据的数量,我们发现令人惊讶的是,更好的图像质量并没有转化为更高的分类性能。像精确率、召回率和F1分数这样的特定类别指标显示,使用合成图像有显著改善,这突出了较少出现的类别的数据重新平衡效果。然而,除了一种DreamBooth方法在整体准确率上提高了0.52之外,大多数模型和配置的整体性能并没有提高。本研究中性能影响的巨大差异表明,在有限数据场景中使用生成模型时需要谨慎考虑,特别是在图像质量与下游分类改进之间存在意外负相关的情况下。

相似文献

1
A Critical Assessment of Generative Models for Synthetic Data Augmentation on Limited Pneumonia X-ray Data.对有限肺炎X光数据上用于合成数据增强的生成模型的批判性评估。
Bioengineering (Basel). 2023 Dec 14;10(12):1421. doi: 10.3390/bioengineering10121421.
2
Data augmentation using Generative Adversarial Networks (GANs) for GAN-based detection of Pneumonia and COVID-19 in chest X-ray images.使用生成对抗网络(GAN)进行数据增强,用于基于GAN的胸部X光图像中肺炎和新冠肺炎的检测。
Inform Med Unlocked. 2021;27:100779. doi: 10.1016/j.imu.2021.100779. Epub 2021 Nov 22.
3
Skin Lesion Synthesis and Classification Using an Improved DCGAN Classifier.使用改进的深度卷积生成对抗网络分类器进行皮肤病变合成与分类
Diagnostics (Basel). 2023 Aug 9;13(16):2635. doi: 10.3390/diagnostics13162635.
4
A GAN-based image synthesis method for skin lesion classification.一种基于生成对抗网络的用于皮肤病变分类的图像合成方法。
Comput Methods Programs Biomed. 2020 Oct;195:105568. doi: 10.1016/j.cmpb.2020.105568. Epub 2020 May 29.
5
Generative adversarial network based synthetic data training model for lightweight convolutional neural networks.用于轻量级卷积神经网络的基于生成对抗网络的合成数据训练模型。
Multimed Tools Appl. 2023 May 20:1-23. doi: 10.1007/s11042-023-15747-6.
6
GAN Inversion for Data Augmentation to Improve Colonoscopy Lesion Classification.用于数据增强以改善结肠镜检查病变分类的生成对抗网络反演
IEEE J Biomed Health Inform. 2025 Jun;29(6):3864-3873. doi: 10.1109/JBHI.2024.3397611.
7
Generative adversarial network based data augmentation for CNN based detection of Covid-19.基于生成对抗网络的数据增强在基于 CNN 的新冠病毒检测中的应用。
Sci Rep. 2022 Nov 10;12(1):19186. doi: 10.1038/s41598-022-23692-x.
8
GSDA: Generative adversarial network-based semi-supervised data augmentation for ultrasound image classification.GSDA:基于生成对抗网络的半监督数据增强用于超声图像分类
Heliyon. 2023 Sep 4;9(9):e19585. doi: 10.1016/j.heliyon.2023.e19585. eCollection 2023 Sep.
9
Multi-domain medical image translation generation for lung image classification based on generative adversarial networks.基于生成对抗网络的肺部图像分类的多领域医学图像翻译生成。
Comput Methods Programs Biomed. 2023 Feb;229:107200. doi: 10.1016/j.cmpb.2022.107200. Epub 2022 Nov 2.
10
Breast cancer detection using synthetic mammograms from generative adversarial networks in convolutional neural networks.利用卷积神经网络中生成对抗网络的合成乳房X光片进行乳腺癌检测。
J Med Imaging (Bellingham). 2019 Jul;6(3):031411. doi: 10.1117/1.JMI.6.3.031411. Epub 2019 Mar 23.

引用本文的文献

1
Scorecard for synthetic medical data evaluation.合成医学数据评估记分卡。
Commun Eng. 2025 Jul 21;4(1):130. doi: 10.1038/s44172-025-00450-1.
2
FLPneXAINet: Federated deep learning and explainable AI for improved pneumonia prediction utilizing GAN-augmented chest X-ray data.FLPneXAINet:用于利用GAN增强胸部X光数据改进肺炎预测的联邦深度学习与可解释人工智能。
PLoS One. 2025 Jul 17;20(7):e0324957. doi: 10.1371/journal.pone.0324957. eCollection 2025.
3
Synergistic pairing of synthetic image generation with disease classification modeling permits rapid digital classification tool development.

本文引用的文献

1
Augmentation strategies for an imbalanced learning problem on a novel COVID-19 severity dataset.针对新型 COVID-19 严重程度数据集的不平衡学习问题的增强策略。
Sci Rep. 2023 Oct 25;13(1):18299. doi: 10.1038/s41598-023-45532-2.
2
Application of clinical and CT imaging features in the evaluation of disease progression in patients with COVID-19.临床和 CT 影像特征在评估 COVID-19 患者疾病进展中的应用。
BMC Pulm Med. 2023 Sep 6;23(1):329. doi: 10.1186/s12890-023-02613-2.
3
A multimodal comparison of latent denoising diffusion probabilistic models and generative adversarial networks for medical image synthesis.
合成图像生成与疾病分类建模的协同配对可实现快速的数字分类工具开发。
Sci Rep. 2024 Oct 27;14(1):25632. doi: 10.1038/s41598-024-77565-6.
基于潜在去噪扩散概率模型和生成对抗网络的医学图像合成的多模态比较。
Sci Rep. 2023 Jul 26;13(1):12098. doi: 10.1038/s41598-023-39278-0.
4
Unsupervised Medical Image Translation With Adversarial Diffusion Models.基于对抗扩散模型的无监督医学图像翻译。
IEEE Trans Med Imaging. 2023 Dec;42(12):3524-3539. doi: 10.1109/TMI.2023.3290149. Epub 2023 Nov 30.
5
ResNetFed: Federated Deep Learning Architecture for Privacy-Preserving Pneumonia Detection from COVID-19 Chest Radiographs.ResNetFed:用于从新冠胸部X光片中进行隐私保护肺炎检测的联邦深度学习架构。
J Healthc Inform Res. 2023 Jun 14;7(2):203-224. doi: 10.1007/s41666-023-00132-7. eCollection 2023 Jun.
6
Leveraging human expert image annotations to improve pneumonia differentiation through human knowledge distillation.利用人类专家的图像标注来通过人类知识蒸馏提高肺炎鉴别能力。
Sci Rep. 2023 Jun 6;13(1):9203. doi: 10.1038/s41598-023-36148-7.
7
An Effective Image-Based Tomato Leaf Disease Segmentation Method Using MC-UNet.一种基于图像的使用MC-UNet的番茄叶病分割有效方法。
Plant Phenomics. 2023 May 15;5:0049. doi: 10.34133/plantphenomics.0049. eCollection 2023.
8
Denoising diffusion probabilistic models for 3D medical image generation.基于去噪扩散概率模型的三维医学图像生成。
Sci Rep. 2023 May 5;13(1):7303. doi: 10.1038/s41598-023-34341-2.
9
Deep Learning Approaches for Data Augmentation in Medical Imaging: A Review.医学成像中用于数据增强的深度学习方法:综述
J Imaging. 2023 Apr 13;9(4):81. doi: 10.3390/jimaging9040081.
10
Diffusion Models in Vision: A Survey.视觉中的扩散模型:综述
IEEE Trans Pattern Anal Mach Intell. 2023 Sep;45(9):10850-10869. doi: 10.1109/TPAMI.2023.3261988. Epub 2023 Aug 7.