• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

图像分类的自动数据增强研究:学习合成、混合和生成

A Survey of Automated Data Augmentation for Image Classification: Learning to Compose, Mix, and Generate.

作者信息

Cheung Tsz-Him, Yeung Dit-Yan

出版信息

IEEE Trans Neural Netw Learn Syst. 2024 Oct;35(10):13185-13205. doi: 10.1109/TNNLS.2023.3282258. Epub 2024 Oct 7.

DOI:10.1109/TNNLS.2023.3282258
PMID:37342945
Abstract

Data augmentation is an effective way to improve the generalization of deep learning models. However, the underlying augmentation methods mainly rely on handcrafted operations, such as flipping and cropping for image data. These augmentation methods are often designed based on human expertise or repeated trials. Meanwhile, automated data augmentation (AutoDA) is a promising research direction that frames the data augmentation process as a learning task and finds the most effective way to augment the data. In this survey, we categorize recent AutoDA methods into the composition-, mixing-, and generation-based approaches and analyze each category in detail. Based on the analysis, we discuss the challenges and future prospects as well as provide guidelines for applying AutoDA methods by considering the dataset, computation effort, and availability of domain-specific transformations. It is hoped that this article can provide a useful list of AutoDA methods and guidelines for data partitioners when deploying AutoDA in practice. The survey can also serve as a reference for further study by researchers in this emerging research area.

摘要

数据增强是提高深度学习模型泛化能力的有效方法。然而,底层的增强方法主要依赖手工操作,如图像数据的翻转和裁剪。这些增强方法通常基于人类专业知识或反复试验来设计。同时,自动数据增强(AutoDA)是一个很有前景的研究方向,它将数据增强过程构建为一个学习任务,并找到增强数据的最有效方法。在本次综述中,我们将近期的AutoDA方法分为基于组合、混合和生成的方法,并对每一类进行详细分析。基于该分析,我们讨论了挑战和未来前景,并通过考虑数据集、计算量和特定领域变换的可用性,为应用AutoDA方法提供了指导原则。希望本文能为在实践中部署AutoDA时的数据划分人员提供一份有用的AutoDA方法列表和指导原则。本综述也可为该新兴研究领域的研究人员进一步研究提供参考。

相似文献

1
A Survey of Automated Data Augmentation for Image Classification: Learning to Compose, Mix, and Generate.图像分类的自动数据增强研究:学习合成、混合和生成
IEEE Trans Neural Netw Learn Syst. 2024 Oct;35(10):13185-13205. doi: 10.1109/TNNLS.2023.3282258. Epub 2024 Oct 7.
2
Data augmentation for medical imaging: A systematic literature review.医学成像中的数据增强:系统文献回顾。
Comput Biol Med. 2023 Jan;152:106391. doi: 10.1016/j.compbiomed.2022.106391. Epub 2022 Dec 9.
3
Image data augmentation techniques based on deep learning: A survey.基于深度学习的图像数据增强技术:一项综述。
Math Biosci Eng. 2024 Jun 12;21(6):6190-6224. doi: 10.3934/mbe.2024272.
4
Distribution-preserving data augmentation.保持分布的数据增强
PeerJ Comput Sci. 2021 May 27;7:e571. doi: 10.7717/peerj-cs.571. eCollection 2021.
5
Automatic data augmentation to improve generalization of deep learning in H&E stained histopathology.基于 H&E 染色组织病理学的深度学习自动数据增强以提高泛化能力。
Comput Biol Med. 2024 Mar;170:108018. doi: 10.1016/j.compbiomed.2024.108018. Epub 2024 Jan 24.
6
Learning to Compose Domain-Specific Transformations for Data Augmentation.学习合成用于数据增强的特定领域变换。
Adv Neural Inf Process Syst. 2017 Dec;30:3239-3249.
7
A comprehensive survey of recent trends in deep learning for digital images augmentation.对数字图像增强的深度学习近期趋势的全面调查。
Artif Intell Rev. 2022;55(3):2351-2377. doi: 10.1007/s10462-021-10066-4. Epub 2021 Sep 4.
8
Generalizing Deep Learning for Medical Image Segmentation to Unseen Domains via Deep Stacked Transformation.通过深度堆叠变换将深度学习用于医学图像分割推广到未见领域。
IEEE Trans Med Imaging. 2020 Jul;39(7):2531-2540. doi: 10.1109/TMI.2020.2973595. Epub 2020 Feb 12.
9
A comparative analysis of different augmentations for brain images.不同脑图像增强方法的比较分析。
Med Biol Eng Comput. 2024 Oct;62(10):3123-3150. doi: 10.1007/s11517-024-03127-7. Epub 2024 May 24.
10
SalfMix: A Novel Single Image-Based Data Augmentation Technique Using a Saliency Map.SalfMix:一种基于显著图的新型单图像数据增强技术。
Sensors (Basel). 2021 Dec 17;21(24):8444. doi: 10.3390/s21248444.

引用本文的文献

1
A dual GAN with identity blocks and pancreas-inspired loss for renewable energy optimization.一种具有身份块和受胰腺启发的损失函数的对偶生成对抗网络,用于可再生能源优化。
Sci Rep. 2025 May 13;15(1):16635. doi: 10.1038/s41598-025-00600-7.
2
Feature Feedback-Based Pseudo-Label Learning for Multi-Standards in Clinical Acne Grading.基于特征反馈的临床痤疮分级多标准伪标签学习
Bioengineering (Basel). 2025 Mar 26;12(4):342. doi: 10.3390/bioengineering12040342.
3
[Identification of osteoid and chondroid matrix mineralization in primary bone tumors using a deep learning fusion model based on CT and clinical features: a multi-center retrospective study].
[基于CT和临床特征的深度学习融合模型在原发性骨肿瘤中类骨质和类软骨基质矿化的识别:一项多中心回顾性研究]
Nan Fang Yi Ke Da Xue Xue Bao. 2024 Dec 20;44(12):2412-2420. doi: 10.12122/j.issn.1673-4254.2024.12.18.
4
A Machine Learning Model for the Prediction of COVID-19 Severity Using RNA-Seq, Clinical, and Co-Morbidity Data.一种使用RNA测序、临床和合并症数据预测COVID-19严重程度的机器学习模型。
Diagnostics (Basel). 2024 Jun 18;14(12):1284. doi: 10.3390/diagnostics14121284.
5
Prediction model for spinal cord injury in spinal tuberculosis patients using multiple machine learning algorithms: a multicentric study.基于多机器学习算法的脊柱结核患者脊髓损伤预测模型:一项多中心研究。
Sci Rep. 2024 Apr 2;14(1):7691. doi: 10.1038/s41598-024-56711-0.