• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于深度学习的图像数据增强技术:一项综述。

Image data augmentation techniques based on deep learning: A survey.

作者信息

Zeng Wu

机构信息

Engineering Training Center, Putian University, Putian 351100, China.

出版信息

Math Biosci Eng. 2024 Jun 12;21(6):6190-6224. doi: 10.3934/mbe.2024272.

DOI:10.3934/mbe.2024272
PMID:39176424
Abstract

In recent years, deep learning (DL) techniques have achieved remarkable success in various fields of computer vision. This progress was attributed to the vast amounts of data utilized to train these models, as they facilitated the learning of more intricate and detailed feature information about target objects, leading to improved model performance. However, in most real-world tasks, it was challenging to gather sufficient data for model training. Insufficient datasets often resulted in models prone to overfitting. To address this issue and enhance model performance, generalization ability, and mitigate overfitting in data-limited scenarios, image data augmentation methods have been proposed. These methods generated synthetic samples to augment the original dataset, emerging as a preferred strategy to boost model performance when data was scarce. This review first introduced commonly used and highly effective image data augmentation techniques, along with a detailed analysis of their advantages and disadvantages. Second, this review presented several datasets frequently employed for evaluating the performance of image data augmentation methods and examined how advanced augmentation techniques can enhance model performance. Third, this review discussed the applications and performance of data augmentation techniques in various computer vision domains. Finally, this review provided an outlook on potential future research directions for image data augmentation methods.

摘要

近年来,深度学习(DL)技术在计算机视觉的各个领域都取得了显著成功。这一进展归因于用于训练这些模型的大量数据,因为它们有助于学习有关目标对象的更复杂、更详细的特征信息,从而提高模型性能。然而,在大多数实际任务中,为模型训练收集足够的数据具有挑战性。数据集不足往往导致模型容易出现过拟合。为了解决这个问题,提高模型性能、泛化能力,并减轻数据受限场景中的过拟合,人们提出了图像数据增强方法。这些方法生成合成样本以扩充原始数据集,成为在数据稀缺时提高模型性能的首选策略。本综述首先介绍了常用且高效的图像数据增强技术,并详细分析了它们的优缺点。其次,本综述介绍了几个常用于评估图像数据增强方法性能的数据集,并研究了先进的增强技术如何提高模型性能。第三,本综述讨论了数据增强技术在各个计算机视觉领域的应用和性能。最后,本综述对图像数据增强方法未来潜在的研究方向进行了展望。

相似文献

1
Image data augmentation techniques based on deep learning: A survey.基于深度学习的图像数据增强技术:一项综述。
Math Biosci Eng. 2024 Jun 12;21(6):6190-6224. doi: 10.3934/mbe.2024272.
2
A medical image classification method based on self-regularized adversarial learning.基于自正则化对抗学习的医学图像分类方法。
Med Phys. 2024 Nov;51(11):8232-8246. doi: 10.1002/mp.17320. Epub 2024 Jul 30.
3
Data Augmentation Techniques for Machine Learning Applied to Optical Spectroscopy Datasets in Agrifood Applications: A Comprehensive Review.用于农业食品应用中光谱数据集的机器学习数据增强技术:全面综述
Sensors (Basel). 2023 Oct 18;23(20):8562. doi: 10.3390/s23208562.
4
Generative adversarial network based synthetic data training model for lightweight convolutional neural networks.用于轻量级卷积神经网络的基于生成对抗网络的合成数据训练模型。
Multimed Tools Appl. 2023 May 20:1-23. doi: 10.1007/s11042-023-15747-6.
5
Data augmentation using generative adversarial networks (CycleGAN) to improve generalizability in CT segmentation tasks.使用生成对抗网络(CycleGAN)进行数据增强以提高 CT 分割任务的泛化能力。
Sci Rep. 2019 Nov 15;9(1):16884. doi: 10.1038/s41598-019-52737-x.
6
2S-BUSGAN: A Novel Generative Adversarial Network for Realistic Breast Ultrasound Image with Corresponding Tumor Contour Based on Small Datasets.2S-BUSGAN:一种基于小数据集的具有真实乳房超声图像和对应肿瘤轮廓的新型生成对抗网络。
Sensors (Basel). 2023 Oct 20;23(20):8614. doi: 10.3390/s23208614.
7
A survey on generative adversarial networks for imbalance problems in computer vision tasks.关于计算机视觉任务中不平衡问题的生成对抗网络调查。
J Big Data. 2021;8(1):27. doi: 10.1186/s40537-021-00414-0. Epub 2021 Jan 29.
8
Data augmentation for enhancing EEG-based emotion recognition with deep generative models.基于深度生成模型的数据增强以增强基于 EEG 的情绪识别。
J Neural Eng. 2020 Oct 14;17(5):056021. doi: 10.1088/1741-2552/abb580.
9
A comprehensive survey of recent trends in deep learning for digital images augmentation.对数字图像增强的深度学习近期趋势的全面调查。
Artif Intell Rev. 2022;55(3):2351-2377. doi: 10.1007/s10462-021-10066-4. Epub 2021 Sep 4.
10
Examining the effect of synthetic data augmentation in polyp detection and segmentation.检测合成数据增强在息肉检测和分割中的效果。
Int J Comput Assist Radiol Surg. 2022 Jul;17(7):1289-1302. doi: 10.1007/s11548-022-02651-x. Epub 2022 Jun 9.

引用本文的文献

1
An optimized deep learning model based on transperineal ultrasound images for precision diagnosis of female stress urinary incontinence.一种基于经会阴超声图像的优化深度学习模型,用于女性压力性尿失禁的精准诊断。
Front Med (Lausanne). 2025 Apr 28;12:1564446. doi: 10.3389/fmed.2025.1564446. eCollection 2025.
2
A Static Sign Language Recognition Method Enhanced with Self-Attention Mechanisms.基于自注意力机制的静态手语识别方法。
Sensors (Basel). 2024 Oct 29;24(21):6921. doi: 10.3390/s24216921.