用于（半）监督学习的最大间隔深度生成模型

Max-Margin Deep Generative Models for (Semi-)Supervised Learning.

作者信息

Li Chongxuan, Zhu Jun, Zhang Bo

出版信息

IEEE Trans Pattern Anal Mach Intell. 2018 Nov;40(11):2762-2775. doi: 10.1109/TPAMI.2017.2766142. Epub 2017 Oct 24.

DOI:10.1109/TPAMI.2017.2766142

Abstract

Deep generative models (DGMs) can effectively capture the underlying distributions of complex data by learning multilayered representations and performing inference. However, it is relatively insufficient to boost the discriminative ability of DGMs. This paper presents max-margin deep generative models (mmDGMs) and a class-conditional variant (mmDCGMs), which explore the strongly discriminative principle of max-margin learning to improve the predictive performance of DGMs in both supervised and semi-supervised learning, while retaining the generative capability. In semi-supervised learning, we use the predictions of a max-margin classifier as the missing labels instead of performing full posterior inference for efficiency; we also introduce additional max-margin and label-balance regularization terms of unlabeled data for effectiveness. We develop an efficient doubly stochastic subgradient algorithm for the piecewise linear objectives in different settings. Empirical results on various datasets demonstrate that: (1) max-margin learning can significantly improve the prediction performance of DGMs and meanwhile retain the generative ability; (2) in supervised learning, mmDGMs are competitive to the best fully discriminative networks when employing convolutional neural networks as the generative and recognition models; and (3) in semi-supervised learning, mmDCGMs can perform efficient inference and achieve state-of-the-art classification results on several benchmarks.

摘要

深度生成模型（DGM）可以通过学习多层表示并进行推理来有效地捕捉复杂数据的潜在分布。然而，提高DGM的判别能力相对不足。本文提出了最大间隔深度生成模型（mmDGM）和一种类条件变体（mmDCGM），它们探索最大间隔学习的强判别原则，以在保留生成能力的同时，提高DGM在监督学习和半监督学习中的预测性能。在半监督学习中，为了提高效率，我们使用最大间隔分类器的预测作为缺失标签，而不是执行完整的后验推理；我们还引入了未标记数据的额外最大间隔和标签平衡正则化项以提高有效性。我们针对不同设置下的分段线性目标开发了一种高效的双重随机次梯度算法。在各种数据集上的实证结果表明：（1）最大间隔学习可以显著提高DGM的预测性能，同时保留生成能力；（2）在监督学习中，当使用卷积神经网络作为生成模型和识别模型时，mmDGM与最佳的完全判别网络具有竞争力；（3）在半监督学习中，mmDCGM可以执行高效推理，并在几个基准测试中取得领先的分类结果。

相似文献

Max-Margin Deep Generative Models for (Semi-)Supervised Learning.用于（半）监督学习的最大间隔深度生成模型

IEEE Trans Pattern Anal Mach Intell. 2018 Nov;40(11):2762-2775. doi: 10.1109/TPAMI.2017.2766142. Epub 2017 Oct 24.

Learning Deep Generative Models With Doubly Stochastic Gradient MCMC.使用双随机梯度马尔可夫链蒙特卡罗学习深度生成模型

IEEE Trans Neural Netw Learn Syst. 2018 Jul;29(7):3084-3096. doi: 10.1109/TNNLS.2017.2688499. Epub 2017 Jun 28.

Max-Margin Majority Voting for Learning from Crowds.基于最大间隔多数投票的众包学习方法

IEEE Trans Pattern Anal Mach Intell. 2019 Oct;41(10):2480-2494. doi: 10.1109/TPAMI.2018.2860987. Epub 2018 Jul 31.

Semi Supervised Learning with Deep Embedded Clustering for Image Classification and Segmentation.用于图像分类和分割的深度嵌入聚类半监督学习

IEEE Access. 2019;7:11093-11104. doi: 10.1109/ACCESS.2019.2891970. Epub 2019 Jan 9.

Deep virtual adversarial self-training with consistency regularization for semi-supervised medical image classification.深度对偶对抗自训练与一致性正则化在半监督医学图像分类中的应用。

Med Image Anal. 2021 May;70:102010. doi: 10.1016/j.media.2021.102010. Epub 2021 Feb 22.

Semi-Supervised Deep Learning Using Pseudo Labels for Hyperspectral Image Classification.基于伪标签的半监督深度学习在高光谱图像分类中的应用。

IEEE Trans Image Process. 2018 Mar;27(3):1259-1270. doi: 10.1109/TIP.2017.2772836. Epub 2017 Nov 13.

Semisupervised learning for a hybrid generative/discriminative classifier based on the maximum entropy principle.基于最大熵原理的混合生成/判别式分类器的半监督学习

IEEE Trans Pattern Anal Mach Intell. 2008 Mar;30(3):424-37. doi: 10.1109/TPAMI.2007.70710.

Semi-Supervised Generative Adversarial Nets with Multiple Generators for SAR Image Recognition.基于多个生成器的半监督生成对抗网络在 SAR 图像识别中的应用。

Sensors (Basel). 2018 Aug 17;18(8):2706. doi: 10.3390/s18082706.

Robust Semi-Supervised Traffic Sign Recognition via Self-Training and Weakly-Supervised Learning.基于自训练和弱监督学习的鲁棒半监督交通标志识别。

Sensors (Basel). 2020 May 8;20(9):2684. doi: 10.3390/s20092684.

Novel deep generative simultaneous recurrent model for efficient representation learning.新型深度生成式同时递归模型，用于高效的表示学习。

Neural Netw. 2018 Nov;107:12-22. doi: 10.1016/j.neunet.2018.04.020. Epub 2018 Aug 9.

引用本文的文献

Supervising the Decoder of Variational Autoencoders to Improve Scientific Utility.监督变分自编码器的解码器以提高科学效用。

IEEE Trans Signal Process. 2022;70:5954-5966. doi: 10.1109/tsp.2022.3230329. Epub 2022 Dec 19.

Research on the Guidance of Youth Labor Education Based on the "Combination of Education and Production Labor" Program Based on the Deep Learning Model.基于深度学习模型的“教育与生产劳动相结合”方案对青年劳动教育指导的研究。

Comput Intell Neurosci. 2022 Oct 11;2022:2576559. doi: 10.1155/2022/2576559. eCollection 2022.

Indirectly-Supervised Anomaly Detection of Clinically-Meaningful Health Events from Smart Home Data.基于智能家居数据的临床意义重大的健康事件间接监督异常检测。

ACM Trans Intell Syst Technol. 2021 Mar;12(2):1-18. doi: 10.1145/3439870. Epub 2021 Feb 11.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

用于（半）监督学习的最大间隔深度生成模型

Max-Margin Deep Generative Models for (Semi-)Supervised Learning.

作者信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献