FMixCutMatch 用于半监督深度学习。

FMixCutMatch for semi-supervised deep learning.

机构信息

School of Software Engineering Beijing Jiaotong University Beijing, China.

出版信息

Neural Netw. 2021 Jan;133:166-176. doi: 10.1016/j.neunet.2020.10.018. Epub 2020 Nov 10.

DOI:10.1016/j.neunet.2020.10.018

Abstract

Mixed sample augmentation (MSA) has witnessed great success in the research area of semi-supervised learning (SSL) and is performed by mixing two training samples as an augmentation strategy to effectively smooth the training space. Following the insights on the efficacy of cut-mix in particular, we propose FMixCut, an MSA that combines Fourier space-based data mixing (FMix) and the proposed Fourier space-based data cutting (FCut) for labeled and unlabeled data augmentation. Specifically, for the SSL task, our approach first generates soft pseudo-labels using the model's previous predictions. The model is then trained to penalize the outputs of the FMix-generated samples so that they are consistent with their mixed soft pseudo-labels. In addition, we propose to use FCut, a new Cutout-based data augmentation strategy that adopts the two masked sample pairs from FMix for weighted cross-entropy minimization. Furthermore, by implementing two regularization techniques, namely, batch label distribution entropy maximization and sample confidence entropy minimization, we further boost the training efficiency. Finally, we introduce a dynamic labeled-unlabeled data mixing (DDM) strategy to further accelerate the convergence of the model. Combining the above process, we finally call our SSL approach as "FMixCutMatch", in short FMCmatch. As a result, the proposed FMCmatch achieves state-of-the-art performance on CIFAR-10/100, SVHN and Mini-Imagenet across a variety of SSL conditions with the CNN-13, WRN-28-2 and ResNet-18 networks. In particular, our method achieves a 4.54% test error on CIFAR-10 with 4K labels under the CNN-13 and a 41.25% Top-1 test error on Mini-Imagenet with 10K labels under the ResNet-18. Our codes for reproducing these results are publicly available at https://github.com/biuyq/FMixCutMatch.

摘要

混合样本增强（MSA）在半监督学习（SSL）研究领域取得了巨大成功，它通过混合两个训练样本作为增强策略，有效地平滑训练空间。基于对 Cut-Mix 有效性的深入了解，我们提出了 FMixCut，这是一种将基于傅里叶空间的数据混合（FMix）和我们提出的基于傅里叶空间的数据切割（FCut）相结合的 MSA，用于标记和未标记数据的增强。具体来说，对于 SSL 任务，我们的方法首先使用模型之前的预测生成软伪标签。然后，我们使用模型来惩罚 FMix 生成的样本的输出，以使它们与混合后的软伪标签一致。此外，我们提出使用 FCut，这是一种新的基于 Cutout 的数据增强策略，它采用 FMix 中的两个掩蔽样本对进行加权交叉熵最小化。此外，通过实施两种正则化技术，即批量标签分布熵最大化和样本置信熵最小化，我们进一步提高了训练效率。最后，我们引入了一种动态的标记-未标记数据混合（DDM）策略，以进一步加速模型的收敛。结合上述过程，我们最终将我们的 SSL 方法称为“FMixCutMatch”，简称 FMCmatch。结果表明，在所提出的 FMCmatch 方法中，在各种 SSL 条件下，使用 CNN-13、WRN-28-2 和 ResNet-18 网络，在 CIFAR-10/100、SVHN 和 Mini-Imagenet 上实现了最先进的性能。特别是，在 CNN-13 下使用 4K 个标签时，我们的方法在 CIFAR-10 上的测试错误率为 4.54%，在 ResNet-18 下使用 10K 个标签时，在 Mini-Imagenet 上的 Top-1 测试错误率为 41.25%。我们重现这些结果的代码可在 https://github.com/biuyq/FMixCutMatch 上获得。

相似文献

FMixCutMatch for semi-supervised deep learning.FMixCutMatch 用于半监督深度学习。

Neural Netw. 2021 Jan;133:166-176. doi: 10.1016/j.neunet.2020.10.018. Epub 2020 Nov 10.

CPSS: Fusing consistency regularization and pseudo-labeling techniques for semi-supervised deep cardiovascular disease detection using all unlabeled electrocardiograms.CPSS：利用所有未标记的心电图进行半监督深度心血管疾病检测的一致性正则化和伪标记技术融合。

Comput Methods Programs Biomed. 2024 Sep;254:108315. doi: 10.1016/j.cmpb.2024.108315. Epub 2024 Jul 4.

Boosting semi-supervised learning with Contrastive Complementary Labeling.基于对比互补标注的半监督学习提升方法。

Neural Netw. 2024 Feb;170:417-426. doi: 10.1016/j.neunet.2023.11.052. Epub 2023 Nov 27.

FaxMatch: Multi-Curriculum Pseudo-Labeling for semi-supervised medical image classification.FaxMatch：用于半监督医学图像分类的多课程伪标签

Med Phys. 2023 May;50(5):3210-3222. doi: 10.1002/mp.16312. Epub 2023 Feb 21.

A distributed semi-supervised learning algorithm based on manifold regularization using wavelet neural network.基于流形正则化的小波神经网络的分布式半监督学习算法。

Neural Netw. 2019 Oct;118:300-309. doi: 10.1016/j.neunet.2018.10.014. Epub 2018 Nov 14.

Mutual consistency learning for semi-supervised medical image segmentation.半监督医学图像分割中的相互一致性学习。

Med Image Anal. 2022 Oct;81:102530. doi: 10.1016/j.media.2022.102530. Epub 2022 Jul 6.

End-to-end novel visual categories learning via auxiliary self-supervision.端到端新颖视觉类别学习的辅助自监督方法。

Neural Netw. 2021 Jul;139:24-32. doi: 10.1016/j.neunet.2021.02.015. Epub 2021 Feb 23.

Graph-Based Self-Training for Semi-Supervised Deep Similarity Learning.基于图的自训练在半监督深度相似性学习中的应用。

Sensors (Basel). 2023 Apr 13;23(8):3944. doi: 10.3390/s23083944.

Handling Imbalanced Data: Uncertainty-Guided Virtual Adversarial Training With Batch Nuclear-Norm Optimization for Semi-Supervised Medical Image Classification.处理不平衡数据：半监督医学图像分类中的不确定性引导的虚拟对抗训练与批量核范数优化。

IEEE J Biomed Health Inform. 2022 Jul;26(7):2983-2994. doi: 10.1109/JBHI.2022.3162748. Epub 2022 Jul 1.

MutexMatch: Semi-Supervised Learning With Mutex-Based Consistency Regularization.互斥匹配：基于互斥一致性正则化的半监督学习

IEEE Trans Neural Netw Learn Syst. 2024 Jun;35(6):8441-8455. doi: 10.1109/TNNLS.2022.3228380. Epub 2024 Jun 3.

引用本文的文献

A new method of semi-supervised learning classification based on multi-mode augmentation in small labeled sample environment.一种在小标注样本环境下基于多模态增强的半监督学习分类新方法。

Sci Rep. 2025 Jul 1;15(1):22022. doi: 10.1038/s41598-025-02324-0.

Applications of deep learning for phishing detection: a systematic literature review.深度学习在网络钓鱼检测中的应用：一项系统的文献综述。

Knowl Inf Syst. 2022;64(6):1457-1500. doi: 10.1007/s10115-022-01672-x. Epub 2022 May 23.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

FMixCutMatch 用于半监督深度学习。

FMixCutMatch for semi-supervised deep learning.

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献