通过互信息最大化学习用于自回归序列建模的分层变分自编码器

Learning Hierarchical Variational Autoencoders With Mutual Information Maximization for Autoregressive Sequence Modeling.

作者信息

Qian Dong, Cheung William K

出版信息

IEEE Trans Pattern Anal Mach Intell. 2023 Feb;45(2):1949-1962. doi: 10.1109/TPAMI.2022.3160509. Epub 2023 Jan 6.

DOI:10.1109/TPAMI.2022.3160509

Abstract

Variational autoencoders (VAEs) are a class of effective deep generative models, with the objective to approximate the true, but unknown data distribution. VAEs make use of latent variables to capture high-level semantics so as to reconstruct the data well with the help of informative latent variables. Yet, training VAEs tends to suffer from posterior collapse, when the decoder is parameterized by an autoregressive model for sequence generation. VAEs can be further enhanced by introducing multiple layers of latent variables, but the posterior collapse issue hinders the adoption of such hierarchical VAEs in real-world applications. In this paper, we introduce InfoMaxHVAE, which integrates mutual information estimated via neural networks into hierarchical VAEs to alleviate posterior collapse, when powerful autoregressive models are used for modeling sequences. Experimental results on a number of text and image datasets show that InfoMaxHVAE can outperform the state-of-the-art baselines and exhibits less posterior collapse. We further show that InfoMaxHVAE can shape a coarse-to-fine hierarchical organization of the latent space.

摘要

变分自编码器（VAEs）是一类有效的深度生成模型，其目标是逼近真实但未知的数据分布。VAEs利用潜在变量来捕获高级语义，以便在信息丰富的潜在变量的帮助下很好地重建数据。然而，当解码器由用于序列生成的自回归模型进行参数化时，训练VAEs往往会遭受后验坍缩问题。通过引入多层潜在变量可以进一步增强VAEs，但后验坍缩问题阻碍了这种分层VAEs在实际应用中的采用。在本文中，我们介绍了InfoMaxHVAE，当使用强大的自回归模型对序列进行建模时，它将通过神经网络估计的互信息集成到分层VAEs中，以缓解后验坍缩。在多个文本和图像数据集上的实验结果表明，InfoMaxHVAE可以优于当前的基准模型，并且表现出较少的后验坍缩。我们进一步表明，InfoMaxHVAE可以塑造潜在空间从粗到细的分层组织。

相似文献

Learning Hierarchical Variational Autoencoders With Mutual Information Maximization for Autoregressive Sequence Modeling.

IEEE Trans Pattern Anal Mach Intell. 2023 Feb;45(2):1949-1962. doi: 10.1109/TPAMI.2022.3160509. Epub 2023 Jan 6.

An Overview of Variational Autoencoders for Source Separation, Finance, and Bio-Signal Applications.

Entropy (Basel). 2021 Dec 28;24(1):55. doi: 10.3390/e24010055.

Hierarchical and Self-Attended Sequence Autoencoder.

IEEE Trans Pattern Anal Mach Intell. 2022 Sep;44(9):4975-4986. doi: 10.1109/TPAMI.2021.3068187. Epub 2022 Aug 4.

VAEs: Fixing Sample Generation for Regularized VAEs.

Comput Vis ACCV. 2020 Nov-Dec;12625:643-660. doi: 10.1007/978-3-030-69538-5_39. Epub 2021 Feb 25.

Variational image registration with learned prior using multi-stage VAEs.

Comput Biol Med. 2024 Aug;178:108785. doi: 10.1016/j.compbiomed.2024.108785. Epub 2024 Jun 25.

3D Generative Model Latent Disentanglement via Local Eigenprojection.

Comput Graph Forum. 2023 Sep;42(6):e14793. doi: 10.1111/cgf.14793. Epub 2023 Apr 4.

Deep Generative Modelling: A Comparative Review of VAEs, GANs, Normalizing Flows, Energy-Based and Autoregressive Models.

IEEE Trans Pattern Anal Mach Intell. 2022 Nov;44(11):7327-7347. doi: 10.1109/TPAMI.2021.3116668. Epub 2022 Oct 4.

Using VAEs to Learn Latent Variables: Observations on Applications in cryo-EM.

ArXiv. 2023 May 10:arXiv:2303.07487v2.

Hierarchical VAEs provide a normative account of motion processing in the primate brain.

bioRxiv. 2023 Nov 5:2023.09.27.559646. doi: 10.1101/2023.09.27.559646.

MCluster-VAEs: An end-to-end variational deep learning-based clustering method for subtype discovery using multi-omics data.

Comput Biol Med. 2022 Nov;150:106085. doi: 10.1016/j.compbiomed.2022.106085. Epub 2022 Sep 6.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

通过互信息最大化学习用于自回归序列建模的分层变分自编码器

Learning Hierarchical Variational Autoencoders With Mutual Information Maximization for Autoregressive Sequence Modeling.

作者信息

出版信息

相似文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献