通过语义数据增强对深度网络进行正则化

Regularizing Deep Networks With Semantic Data Augmentation.

作者信息

Wang Yulin, Huang Gao, Song Shiji, Pan Xuran, Xia Yitong, Wu Cheng

出版信息

IEEE Trans Pattern Anal Mach Intell. 2022 Jul;44(7):3733-3748. doi: 10.1109/TPAMI.2021.3052951. Epub 2022 Jun 3.

DOI:10.1109/TPAMI.2021.3052951

Abstract

Data augmentation is widely known as a simple yet surprisingly effective technique for regularizing deep networks. Conventional data augmentation schemes, e.g., flipping, translation or rotation, are low-level, data-independent and class-agnostic operations, leading to limited diversity for augmented samples. To this end, we propose a novel semantic data augmentation algorithm to complement traditional approaches. The proposed method is inspired by the intriguing property that deep networks are effective in learning linearized features, i.e., certain directions in the deep feature space correspond to meaningful semantic transformations, e.g., changing the background or view angle of an object. Based on this observation, translating training samples along many such directions in the feature space can effectively augment the dataset for more diversity. To implement this idea, we first introduce a sampling based method to obtain semantically meaningful directions efficiently. Then, an upper bound of the expected cross-entropy (CE) loss on the augmented training set is derived by assuming the number of augmented samples goes to infinity, yielding a highly efficient algorithm. In fact, we show that the proposed implicit semantic data augmentation (ISDA) algorithm amounts to minimizing a novel robust CE loss, which adds minimal extra computational cost to a normal training procedure. In addition to supervised learning, ISDA can be applied to semi-supervised learning tasks under the consistency regularization framework, where ISDA amounts to minimizing the upper bound of the expected KL-divergence between the augmented features and the original features. Although being simple, ISDA consistently improves the generalization performance of popular deep models (e.g., ResNets and DenseNets) on a variety of datasets, i.e., CIFAR-10, CIFAR-100, SVHN, ImageNet, and Cityscapes. Code for reproducing our results is available at https://github.com/blackfeather-wang/ISDA-for-Deep-Networks.

摘要

数据增强作为一种用于正则化深度网络的简单却惊人有效的技术广为人知。传统的数据增强方案，例如翻转、平移或旋转，都是低级的、与数据无关且不区分类别的操作，导致增强样本的多样性有限。为此，我们提出了一种新颖的语义数据增强算法来补充传统方法。所提出的方法受到这样一个有趣特性的启发：深度网络在学习线性化特征方面很有效，即深度特征空间中的某些方向对应于有意义的语义变换，例如改变物体的背景或视角。基于这一观察，在特征空间中沿着许多这样的方向平移训练样本可以有效地扩充数据集以获得更多样性。为了实现这个想法，我们首先引入一种基于采样的方法来高效地获得语义上有意义的方向。然后，通过假设增强样本的数量趋于无穷大，推导出增强训练集上期望交叉熵（CE）损失的上界，从而得到一种高效的算法。事实上，我们表明所提出的隐式语义数据增强（ISDA）算法相当于最小化一种新颖的鲁棒CE损失，这在正常训练过程中只增加了极少的额外计算成本。除了监督学习，ISDA可以应用于一致性正则化框架下的半监督学习任务，在这种情况下，ISDA相当于最小化增强特征与原始特征之间期望KL散度的上界。尽管ISDA很简单，但它在各种数据集（即CIFAR - 10、CIFAR - 100、SVHN、ImageNet和Cityscapes）上持续提高了流行深度模型（例如ResNets和DenseNets）的泛化性能。用于重现我们结果的代码可在https://github.com/blackfeather - wang/ISDA - for - Deep - Networks获取。

相似文献

Regularizing Deep Networks With Semantic Data Augmentation.

IEEE Trans Pattern Anal Mach Intell. 2022 Jul;44(7):3733-3748. doi: 10.1109/TPAMI.2021.3052951. Epub 2022 Jun 3.

FMixCutMatch for semi-supervised deep learning.

Neural Netw. 2021 Jan;133:166-176. doi: 10.1016/j.neunet.2020.10.018. Epub 2020 Nov 10.

Fine-Grained Recognition With Learnable Semantic Data Augmentation.

IEEE Trans Image Process. 2024;33:3130-3144. doi: 10.1109/TIP.2024.3364500. Epub 2024 Apr 30.

Brain-inspired semantic data augmentation for multi-style images.

Front Neurorobot. 2024 Mar 26;18:1382406. doi: 10.3389/fnbot.2024.1382406. eCollection 2024.

Smooth-Guided Implicit Data Augmentation for Domain Generalization.

IEEE Trans Neural Netw Learn Syst. 2025 Mar;36(3):4984-4995. doi: 10.1109/TNNLS.2024.3377439. Epub 2025 Feb 28.

Co-Training for Visual Object Recognition Based on Self-Supervised Models Using a Cross-Entropy Regularization.

Entropy (Basel). 2021 Apr 1;23(4):423. doi: 10.3390/e23040423.

Interpolation-Based Contrastive Learning for Few-Label Semi-Supervised Learning.

IEEE Trans Neural Netw Learn Syst. 2024 Feb;35(2):2054-2065. doi: 10.1109/TNNLS.2022.3186512. Epub 2024 Feb 5.

Self-Adaptive Training: Bridging Supervised and Self-Supervised Learning.

IEEE Trans Pattern Anal Mach Intell. 2024 Mar;46(3):1362-1377. doi: 10.1109/TPAMI.2022.3217792. Epub 2024 Feb 6.

MutexMatch: Semi-Supervised Learning With Mutex-Based Consistency Regularization.

IEEE Trans Neural Netw Learn Syst. 2024 Jun;35(6):8441-8455. doi: 10.1109/TNNLS.2022.3228380. Epub 2024 Jun 3.

EnAET: A Self-Trained Framework for Semi-Supervised and Supervised Learning With Ensemble Transformations.

IEEE Trans Image Process. 2021;30:1639-1647. doi: 10.1109/TIP.2020.3044220. Epub 2021 Jan 11.

引用本文的文献

Dynamic key vascular anatomy dataset for D2 lymph node dissection during laparoscopic gastric cancer surgery.

Sci Data. 2025 May 29;12(1):903. doi: 10.1038/s41597-025-05255-7.

BSDA: Bayesian Random Semantic Data Augmentation for Medical Image Classification.

Sensors (Basel). 2024 Nov 25;24(23):7511. doi: 10.3390/s24237511.

Data augmentation via warping transforms for modeling natural variability in the corneal endothelium enhances semi-supervised segmentation.

PLoS One. 2024 Nov 12;19(11):e0311849. doi: 10.1371/journal.pone.0311849. eCollection 2024.

Non-invasive diagnosis of pancreatic steatosis with ultrasound images using deep learning network.

Heliyon. 2024 Sep 6;10(17):e37580. doi: 10.1016/j.heliyon.2024.e37580. eCollection 2024 Sep 15.

Brain-inspired semantic data augmentation for multi-style images.

Front Neurorobot. 2024 Mar 26;18:1382406. doi: 10.3389/fnbot.2024.1382406. eCollection 2024.

Adversarial counterfactual augmentation: application in Alzheimer's disease classification.

Front Radiol. 2022 Nov 30;2:1039160. doi: 10.3389/fradi.2022.1039160. eCollection 2022.

A Double-Teacher Model Capable of Exploiting Isomorphic and Heterogeneous Discrepancy Information for Medical Image Segmentation.

Diagnostics (Basel). 2023 Jun 5;13(11):1971. doi: 10.3390/diagnostics13111971.

Semi-Supervised Medical Image Segmentation Guided by Bi-Directional Constrained Dual-Task Consistency.

Bioengineering (Basel). 2023 Feb 7;10(2):225. doi: 10.3390/bioengineering10020225.

Ultrasound image-based deep learning to differentiate tubal-ovarian abscess from ovarian endometriosis cyst.

Front Physiol. 2023 Feb 7;14:1101810. doi: 10.3389/fphys.2023.1101810. eCollection 2023.

A Review of Performance Prediction Based on Machine Learning in Materials Science.

Nanomaterials (Basel). 2022 Aug 26;12(17):2957. doi: 10.3390/nano12172957.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

通过语义数据增强对深度网络进行正则化

Regularizing Deep Networks With Semantic Data Augmentation.

作者信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献