• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过语义数据增强对深度网络进行正则化

Regularizing Deep Networks With Semantic Data Augmentation.

作者信息

Wang Yulin, Huang Gao, Song Shiji, Pan Xuran, Xia Yitong, Wu Cheng

出版信息

IEEE Trans Pattern Anal Mach Intell. 2022 Jul;44(7):3733-3748. doi: 10.1109/TPAMI.2021.3052951. Epub 2022 Jun 3.

DOI:10.1109/TPAMI.2021.3052951
PMID:33476265
Abstract

Data augmentation is widely known as a simple yet surprisingly effective technique for regularizing deep networks. Conventional data augmentation schemes, e.g., flipping, translation or rotation, are low-level, data-independent and class-agnostic operations, leading to limited diversity for augmented samples. To this end, we propose a novel semantic data augmentation algorithm to complement traditional approaches. The proposed method is inspired by the intriguing property that deep networks are effective in learning linearized features, i.e., certain directions in the deep feature space correspond to meaningful semantic transformations, e.g., changing the background or view angle of an object. Based on this observation, translating training samples along many such directions in the feature space can effectively augment the dataset for more diversity. To implement this idea, we first introduce a sampling based method to obtain semantically meaningful directions efficiently. Then, an upper bound of the expected cross-entropy (CE) loss on the augmented training set is derived by assuming the number of augmented samples goes to infinity, yielding a highly efficient algorithm. In fact, we show that the proposed implicit semantic data augmentation (ISDA) algorithm amounts to minimizing a novel robust CE loss, which adds minimal extra computational cost to a normal training procedure. In addition to supervised learning, ISDA can be applied to semi-supervised learning tasks under the consistency regularization framework, where ISDA amounts to minimizing the upper bound of the expected KL-divergence between the augmented features and the original features. Although being simple, ISDA consistently improves the generalization performance of popular deep models (e.g., ResNets and DenseNets) on a variety of datasets, i.e., CIFAR-10, CIFAR-100, SVHN, ImageNet, and Cityscapes. Code for reproducing our results is available at https://github.com/blackfeather-wang/ISDA-for-Deep-Networks.

摘要

数据增强作为一种用于正则化深度网络的简单却惊人有效的技术广为人知。传统的数据增强方案,例如翻转、平移或旋转,都是低级的、与数据无关且不区分类别的操作,导致增强样本的多样性有限。为此,我们提出了一种新颖的语义数据增强算法来补充传统方法。所提出的方法受到这样一个有趣特性的启发:深度网络在学习线性化特征方面很有效,即深度特征空间中的某些方向对应于有意义的语义变换,例如改变物体的背景或视角。基于这一观察,在特征空间中沿着许多这样的方向平移训练样本可以有效地扩充数据集以获得更多样性。为了实现这个想法,我们首先引入一种基于采样的方法来高效地获得语义上有意义的方向。然后,通过假设增强样本的数量趋于无穷大,推导出增强训练集上期望交叉熵(CE)损失的上界,从而得到一种高效的算法。事实上,我们表明所提出的隐式语义数据增强(ISDA)算法相当于最小化一种新颖的鲁棒CE损失,这在正常训练过程中只增加了极少的额外计算成本。除了监督学习,ISDA可以应用于一致性正则化框架下的半监督学习任务,在这种情况下,ISDA相当于最小化增强特征与原始特征之间期望KL散度的上界。尽管ISDA很简单,但它在各种数据集(即CIFAR - 10、CIFAR - 100、SVHN、ImageNet和Cityscapes)上持续提高了流行深度模型(例如ResNets和DenseNets)的泛化性能。用于重现我们结果的代码可在https://github.com/blackfeather - wang/ISDA - for - Deep - Networks获取。

相似文献

1
Regularizing Deep Networks With Semantic Data Augmentation.通过语义数据增强对深度网络进行正则化
IEEE Trans Pattern Anal Mach Intell. 2022 Jul;44(7):3733-3748. doi: 10.1109/TPAMI.2021.3052951. Epub 2022 Jun 3.
2
FMixCutMatch for semi-supervised deep learning.FMixCutMatch 用于半监督深度学习。
Neural Netw. 2021 Jan;133:166-176. doi: 10.1016/j.neunet.2020.10.018. Epub 2020 Nov 10.
3
Fine-Grained Recognition With Learnable Semantic Data Augmentation.基于可学习语义数据增强的细粒度识别
IEEE Trans Image Process. 2024;33:3130-3144. doi: 10.1109/TIP.2024.3364500. Epub 2024 Apr 30.
4
Brain-inspired semantic data augmentation for multi-style images.用于多风格图像的脑启发式语义数据增强
Front Neurorobot. 2024 Mar 26;18:1382406. doi: 10.3389/fnbot.2024.1382406. eCollection 2024.
5
Smooth-Guided Implicit Data Augmentation for Domain Generalization.用于领域泛化的平滑引导隐式数据增强
IEEE Trans Neural Netw Learn Syst. 2025 Mar;36(3):4984-4995. doi: 10.1109/TNNLS.2024.3377439. Epub 2025 Feb 28.
6
Co-Training for Visual Object Recognition Based on Self-Supervised Models Using a Cross-Entropy Regularization.基于使用交叉熵正则化的自监督模型的视觉目标识别协同训练
Entropy (Basel). 2021 Apr 1;23(4):423. doi: 10.3390/e23040423.
7
Interpolation-Based Contrastive Learning for Few-Label Semi-Supervised Learning.基于插值的少标签半监督学习对比学习
IEEE Trans Neural Netw Learn Syst. 2024 Feb;35(2):2054-2065. doi: 10.1109/TNNLS.2022.3186512. Epub 2024 Feb 5.
8
Self-Adaptive Training: Bridging Supervised and Self-Supervised Learning.自适应训练:连接监督学习与自监督学习
IEEE Trans Pattern Anal Mach Intell. 2024 Mar;46(3):1362-1377. doi: 10.1109/TPAMI.2022.3217792. Epub 2024 Feb 6.
9
MutexMatch: Semi-Supervised Learning With Mutex-Based Consistency Regularization.互斥匹配:基于互斥一致性正则化的半监督学习
IEEE Trans Neural Netw Learn Syst. 2024 Jun;35(6):8441-8455. doi: 10.1109/TNNLS.2022.3228380. Epub 2024 Jun 3.
10
EnAET: A Self-Trained Framework for Semi-Supervised and Supervised Learning With Ensemble Transformations.EnAET:一种用于半监督和监督学习的集成变换自训练框架。
IEEE Trans Image Process. 2021;30:1639-1647. doi: 10.1109/TIP.2020.3044220. Epub 2021 Jan 11.

引用本文的文献

1
Dynamic key vascular anatomy dataset for D2 lymph node dissection during laparoscopic gastric cancer surgery.腹腔镜胃癌手术中D2淋巴结清扫的动态关键血管解剖数据集
Sci Data. 2025 May 29;12(1):903. doi: 10.1038/s41597-025-05255-7.
2
BSDA: Bayesian Random Semantic Data Augmentation for Medical Image Classification.BSDA:用于医学图像分类的贝叶斯随机语义数据增强
Sensors (Basel). 2024 Nov 25;24(23):7511. doi: 10.3390/s24237511.
3
Data augmentation via warping transforms for modeling natural variability in the corneal endothelium enhances semi-supervised segmentation.
通过变形变换进行数据增强,以模拟角膜内皮的自然变异性,从而增强半监督分割。
PLoS One. 2024 Nov 12;19(11):e0311849. doi: 10.1371/journal.pone.0311849. eCollection 2024.
4
Non-invasive diagnosis of pancreatic steatosis with ultrasound images using deep learning network.使用深度学习网络通过超声图像对胰腺脂肪变性进行无创诊断。
Heliyon. 2024 Sep 6;10(17):e37580. doi: 10.1016/j.heliyon.2024.e37580. eCollection 2024 Sep 15.
5
Brain-inspired semantic data augmentation for multi-style images.用于多风格图像的脑启发式语义数据增强
Front Neurorobot. 2024 Mar 26;18:1382406. doi: 10.3389/fnbot.2024.1382406. eCollection 2024.
6
Adversarial counterfactual augmentation: application in Alzheimer's disease classification.对抗性反事实增强:在阿尔茨海默病分类中的应用
Front Radiol. 2022 Nov 30;2:1039160. doi: 10.3389/fradi.2022.1039160. eCollection 2022.
7
A Double-Teacher Model Capable of Exploiting Isomorphic and Heterogeneous Discrepancy Information for Medical Image Segmentation.一种能够利用同构和异构差异信息进行医学图像分割的双教师模型。
Diagnostics (Basel). 2023 Jun 5;13(11):1971. doi: 10.3390/diagnostics13111971.
8
Semi-Supervised Medical Image Segmentation Guided by Bi-Directional Constrained Dual-Task Consistency.基于双向约束双任务一致性引导的半监督医学图像分割
Bioengineering (Basel). 2023 Feb 7;10(2):225. doi: 10.3390/bioengineering10020225.
9
Ultrasound image-based deep learning to differentiate tubal-ovarian abscess from ovarian endometriosis cyst.基于超声图像的深度学习用于鉴别输卵管卵巢脓肿与卵巢子宫内膜异位囊肿。
Front Physiol. 2023 Feb 7;14:1101810. doi: 10.3389/fphys.2023.1101810. eCollection 2023.
10
A Review of Performance Prediction Based on Machine Learning in Materials Science.基于机器学习的材料科学性能预测综述
Nanomaterials (Basel). 2022 Aug 26;12(17):2957. doi: 10.3390/nano12172957.