• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

分类与分割中的数据增强:综述与新策略

Data Augmentation in Classification and Segmentation: A Survey and New Strategies.

作者信息

Alomar Khaled, Aysel Halil Ibrahim, Cai Xiaohao

机构信息

School of Electronics and Computer Science, University of Southampton, Southampton SO17 1BJ, UK.

出版信息

J Imaging. 2023 Feb 17;9(2):46. doi: 10.3390/jimaging9020046.

DOI:10.3390/jimaging9020046
PMID:36826965
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9966095/
Abstract

In the past decade, deep neural networks, particularly convolutional neural networks, have revolutionised computer vision. However, all deep learning models may require a large amount of data so as to achieve satisfying results. Unfortunately, the availability of sufficient amounts of data for real-world problems is not always possible, and it is well recognised that a paucity of data easily results in overfitting. This issue may be addressed through several approaches, one of which is data augmentation. In this paper, we survey the existing data augmentation techniques in computer vision tasks, including segmentation and classification, and suggest new strategies. In particular, we introduce a way of implementing data augmentation by using local information in images. We propose a parameter-free and easy to implement strategy, the random local rotation strategy, which involves randomly selecting the location and size of circular regions in the image and rotating them with random angles. It can be used as an alternative to the traditional rotation strategy, which generally suffers from irregular image boundaries. It can also complement other techniques in data augmentation. Extensive experimental results and comparisons demonstrated that the new strategy consistently outperformed its traditional counterparts in, for example, image classification.

摘要

在过去十年中,深度神经网络,尤其是卷积神经网络,彻底改变了计算机视觉。然而,所有深度学习模型可能都需要大量数据才能取得令人满意的结果。不幸的是,对于现实世界的问题而言,并不总是能够获得足够的数据量,而且人们普遍认识到数据匮乏很容易导致过拟合。这个问题可以通过几种方法来解决,其中之一就是数据增强。在本文中,我们调研了计算机视觉任务(包括分割和分类)中现有的数据增强技术,并提出了新的策略。特别是,我们介绍了一种利用图像中的局部信息来实现数据增强的方法。我们提出了一种无参数且易于实现的策略——随机局部旋转策略,该策略涉及在图像中随机选择圆形区域的位置和大小,并以随机角度旋转它们。它可以作为传统旋转策略的替代方法,传统旋转策略通常存在图像边界不规则的问题。它还可以补充数据增强中的其他技术。大量的实验结果和比较表明,新策略在例如图像分类等方面始终优于传统方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f0aa/9966095/856f635c9fe2/jimaging-09-00046-g019.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f0aa/9966095/d8d9630d3865/jimaging-09-00046-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f0aa/9966095/bab50184bda2/jimaging-09-00046-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f0aa/9966095/284ad6bba7ff/jimaging-09-00046-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f0aa/9966095/6b7e1e116c68/jimaging-09-00046-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f0aa/9966095/94efabbbfb30/jimaging-09-00046-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f0aa/9966095/f50d57d857bf/jimaging-09-00046-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f0aa/9966095/83f26a527a94/jimaging-09-00046-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f0aa/9966095/361127a129a5/jimaging-09-00046-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f0aa/9966095/263d1c7ea540/jimaging-09-00046-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f0aa/9966095/9eeaae2f7235/jimaging-09-00046-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f0aa/9966095/6c6336c95441/jimaging-09-00046-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f0aa/9966095/eb174ef140eb/jimaging-09-00046-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f0aa/9966095/29813710055e/jimaging-09-00046-g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f0aa/9966095/ba66f5e1f025/jimaging-09-00046-g014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f0aa/9966095/ffee2712aacc/jimaging-09-00046-g015.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f0aa/9966095/7d041d0f94db/jimaging-09-00046-g016.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f0aa/9966095/1dc19601f920/jimaging-09-00046-g017.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f0aa/9966095/2181c831c090/jimaging-09-00046-g018.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f0aa/9966095/856f635c9fe2/jimaging-09-00046-g019.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f0aa/9966095/d8d9630d3865/jimaging-09-00046-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f0aa/9966095/bab50184bda2/jimaging-09-00046-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f0aa/9966095/284ad6bba7ff/jimaging-09-00046-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f0aa/9966095/6b7e1e116c68/jimaging-09-00046-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f0aa/9966095/94efabbbfb30/jimaging-09-00046-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f0aa/9966095/f50d57d857bf/jimaging-09-00046-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f0aa/9966095/83f26a527a94/jimaging-09-00046-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f0aa/9966095/361127a129a5/jimaging-09-00046-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f0aa/9966095/263d1c7ea540/jimaging-09-00046-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f0aa/9966095/9eeaae2f7235/jimaging-09-00046-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f0aa/9966095/6c6336c95441/jimaging-09-00046-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f0aa/9966095/eb174ef140eb/jimaging-09-00046-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f0aa/9966095/29813710055e/jimaging-09-00046-g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f0aa/9966095/ba66f5e1f025/jimaging-09-00046-g014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f0aa/9966095/ffee2712aacc/jimaging-09-00046-g015.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f0aa/9966095/7d041d0f94db/jimaging-09-00046-g016.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f0aa/9966095/1dc19601f920/jimaging-09-00046-g017.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f0aa/9966095/2181c831c090/jimaging-09-00046-g018.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f0aa/9966095/856f635c9fe2/jimaging-09-00046-g019.jpg

相似文献

1
Data Augmentation in Classification and Segmentation: A Survey and New Strategies.分类与分割中的数据增强:综述与新策略
J Imaging. 2023 Feb 17;9(2):46. doi: 10.3390/jimaging9020046.
2
MeshCut data augmentation for deep learning in computer vision.基于网格切割的数据增强在计算机视觉中的深度学习应用。
PLoS One. 2020 Dec 23;15(12):e0243613. doi: 10.1371/journal.pone.0243613. eCollection 2020.
3
A comprehensive survey of recent trends in deep learning for digital images augmentation.对数字图像增强的深度学习近期趋势的全面调查。
Artif Intell Rev. 2022;55(3):2351-2377. doi: 10.1007/s10462-021-10066-4. Epub 2021 Sep 4.
4
Fully Convolutional Network for the Semantic Segmentation of Medical Images: A Survey.用于医学图像语义分割的全卷积网络综述
Diagnostics (Basel). 2022 Nov 11;12(11):2765. doi: 10.3390/diagnostics12112765.
5
CLoDSA: a tool for augmentation in classification, localization, detection, semantic segmentation and instance segmentation tasks.CLoDSA:用于分类、定位、检测、语义分割和实例分割任务增强的工具。
BMC Bioinformatics. 2019 Jun 13;20(1):323. doi: 10.1186/s12859-019-2931-1.
6
Prostate segmentation in MRI using a convolutional neural network architecture and training strategy based on statistical shape models.基于统计形状模型的卷积神经网络架构和训练策略的 MRI 前列腺分割。
Int J Comput Assist Radiol Surg. 2018 Aug;13(8):1211-1219. doi: 10.1007/s11548-018-1785-8. Epub 2018 May 15.
7
Deep learning for colon cancer histopathological images analysis.用于结肠癌组织病理学图像分析的深度学习
Comput Biol Med. 2021 Sep;136:104730. doi: 10.1016/j.compbiomed.2021.104730. Epub 2021 Aug 4.
8
A novel end-to-end classifier using domain transferred deep convolutional neural networks for biomedical images.一种使用域转移深度卷积神经网络的新型端到端生物医学图像分类器。
Comput Methods Programs Biomed. 2017 Mar;140:283-293. doi: 10.1016/j.cmpb.2016.12.019. Epub 2017 Jan 6.
9
Image data augmentation techniques based on deep learning: A survey.基于深度学习的图像数据增强技术:一项综述。
Math Biosci Eng. 2024 Jun 12;21(6):6190-6224. doi: 10.3934/mbe.2024272.
10
HistoClean: Open-source software for histological image pre-processing and augmentation to improve development of robust convolutional neural networks.HistoClean:用于组织学图像预处理和增强的开源软件,以促进强大卷积神经网络的发展。
Comput Struct Biotechnol J. 2021 Aug 26;19:4840-4853. doi: 10.1016/j.csbj.2021.08.033. eCollection 2021.

引用本文的文献

1
A Novel Hybrid Technique for Detecting and Classifying Hyperspectral Images of Tomato Fungal Diseases Based on Deep Feature Extraction and Manhattan Distance.一种基于深度特征提取和曼哈顿距离的番茄真菌病害高光谱图像检测与分类新混合技术。
Sensors (Basel). 2025 Jul 9;25(14):4285. doi: 10.3390/s25144285.
2
Enhancing Multi-User Activity Recognition in an Indoor Environment with Augmented Wi-Fi Channel State Information and Transformer Architectures.利用增强的Wi-Fi信道状态信息和Transformer架构增强室内环境中的多用户活动识别
Sensors (Basel). 2025 Jun 25;25(13):3955. doi: 10.3390/s25133955.
3
Deep Learning-Based Detection of Honey Storage Areas in Colonies for Predicting Physical Parameters of Honey via Linear Regression.

本文引用的文献

1
Comparison of Different Image Data Augmentation Approaches.不同图像数据增强方法的比较
J Imaging. 2021 Nov 27;7(12):254. doi: 10.3390/jimaging7120254.
2
A comprehensive survey of recent trends in deep learning for digital images augmentation.对数字图像增强的深度学习近期趋势的全面调查。
Artif Intell Rev. 2022;55(3):2351-2377. doi: 10.1007/s10462-021-10066-4. Epub 2021 Sep 4.
3
Improving convolutional neural networks performance for image classification using test time augmentation: a case study using MURA dataset.
基于深度学习的蜂群蜂蜜储存区域检测,通过线性回归预测蜂蜜物理参数
Insects. 2025 May 29;16(6):575. doi: 10.3390/insects16060575.
4
Improving healthcare sustainability using advanced brain simulations using a multi-modal deep learning strategy with VGG19 and bidirectional LSTM.使用具有VGG19和双向长短期记忆网络的多模态深度学习策略,通过先进的脑模拟来提高医疗保健的可持续性。
Front Med (Lausanne). 2025 Apr 10;12:1574428. doi: 10.3389/fmed.2025.1574428. eCollection 2025.
5
Advances in Deep Learning for Semantic Segmentation of Low-Contrast Images: A Systematic Review of Methods, Challenges, and Future Directions.低对比度图像语义分割的深度学习进展:方法、挑战及未来方向的系统综述
Sensors (Basel). 2025 Mar 25;25(7):2043. doi: 10.3390/s25072043.
6
EMI-LTI: An enhanced integrated model for lung tumor identification using Gabor filter and ROI.EMI-LTI:一种使用Gabor滤波器和感兴趣区域进行肺肿瘤识别的增强集成模型。
MethodsX. 2025 Feb 27;14:103247. doi: 10.1016/j.mex.2025.103247. eCollection 2025 Jun.
7
Data augmented lung cancer prediction framework using the nested case control NLST cohort.使用巢式病例对照NLST队列的数据增强肺癌预测框架。
Front Oncol. 2025 Feb 25;15:1492758. doi: 10.3389/fonc.2025.1492758. eCollection 2025.
8
Swin-Unet++: a study on phenotypic parameter analysis of cabbage seedling roots.Swin-Unet++:甘蓝幼苗根系表型参数分析研究
Plant Methods. 2025 Mar 3;21(1):30. doi: 10.1186/s13007-025-01340-5.
9
PC-CS-YOLO: High-Precision Obstacle Detection for Visually Impaired Safety.PC-CS-YOLO:用于视障安全的高精度障碍物检测
Sensors (Basel). 2025 Jan 17;25(2):534. doi: 10.3390/s25020534.
10
An Enhanced Approach Using AGS Network for Skin Cancer Classification.一种使用AGS网络进行皮肤癌分类的增强方法。
Sensors (Basel). 2025 Jan 10;25(2):394. doi: 10.3390/s25020394.
使用测试时增强提高卷积神经网络在图像分类任务中的性能:以MURA数据集为例的研究
Health Inf Sci Syst. 2021 Jul 31;9(1):33. doi: 10.1007/s13755-021-00163-7. eCollection 2021 Dec.
4
Text Data Augmentation for Deep Learning.用于深度学习的文本数据增强
J Big Data. 2021;8(1):101. doi: 10.1186/s40537-021-00492-0. Epub 2021 Jul 19.
5
A review of medical image data augmentation techniques for deep learning applications.医学图像数据增强技术在深度学习应用中的综述。
J Med Imaging Radiat Oncol. 2021 Aug;65(5):545-563. doi: 10.1111/1754-9485.13261. Epub 2021 Jun 19.
6
An enhanced deep learning approach for brain cancer MRI images classification using residual networks.基于残差网络的脑癌 MRI 图像分类增强型深度学习方法。
Artif Intell Med. 2020 Jan;102:101779. doi: 10.1016/j.artmed.2019.101779. Epub 2019 Dec 10.
7
Using Synthetic Training Data for Deep Learning-Based GBM Segmentation.使用合成训练数据进行基于深度学习的胶质母细胞瘤分割
Annu Int Conf IEEE Eng Med Biol Soc. 2019 Jul;2019:6724-6729. doi: 10.1109/EMBC.2019.8856297.
8
CLoDSA: a tool for augmentation in classification, localization, detection, semantic segmentation and instance segmentation tasks.CLoDSA:用于分类、定位、检测、语义分割和实例分割任务增强的工具。
BMC Bioinformatics. 2019 Jun 13;20(1):323. doi: 10.1186/s12859-019-2931-1.
9
CNN-Based Real-Time Dense Face Reconstruction with Inverse-Rendered Photo-Realistic Face Images.基于卷积神经网络的实时密集人脸重建与逆渲染逼真人脸图像
IEEE Trans Pattern Anal Mach Intell. 2019 Jun;41(6):1294-1307. doi: 10.1109/TPAMI.2018.2837742. Epub 2018 May 17.
10
Differential Data Augmentation Techniques for Medical Imaging Classification Tasks.用于医学影像分类任务的差分数据增强技术
AMIA Annu Symp Proc. 2018 Apr 16;2017:979-984. eCollection 2017.