分层和自关注序列自动编码器。

IEEE Trans Pattern Anal Mach Intell. 2022 Sep;44(9):4975-4986. doi: 10.1109/TPAMI.2021.3068187. Epub 2022 Aug 4.

It is important and challenging to infer stochastic latent semantics for natural language applications. The difficulty in stochastic sequential learning is caused by the posterior collapse in variational inference. The input sequence is disregarded in the estimated latent variables. This paper proposes three components to tackle this difficulty and build the variational sequence autoencoder (VSAE) where sufficient latent information is learned for sophisticated sequence representation. First, the complementary encoders based on a long short-term memory (LSTM) and a pyramid bidirectional LSTM are merged to characterize global and structural dependencies of an input sequence, respectively. Second, a stochastic self attention mechanism is incorporated in a recurrent decoder. The latent information is attended to encourage the interaction between inference and generation in an encoder-decoder training procedure. Third, an autoregressive Gaussian prior of latent variable is used to preserve the information bound. Different variants of VSAE are proposed to mitigate the posterior collapse in sequence modeling. A series of experiments are conducted to demonstrate that the proposed individual and hybrid sequence autoencoders substantially improve the performance for variational sequential learning in language modeling and semantic understanding for document classification and summarization.

对于自然语言应用来说，推断随机潜在语义是重要且具有挑战性的。随机序列学习的难点在于变分推断中的后验崩溃。在估计的潜在变量中忽略了输入序列。本文提出了三个组件来解决这个困难，并构建了变分序列自动编码器（VSAE），在该模型中可以学习到足够的潜在信息，从而实现复杂的序列表示。首先，基于长短期记忆（LSTM）和金字塔双向 LSTM 的互补编码器分别用于描述输入序列的全局和结构依赖关系。其次，在递归解码器中加入了随机自注意力机制。潜在信息被关注，以鼓励编码器-解码器训练过程中的推理和生成之间的交互。第三，使用潜在变量的自回归高斯先验来保留信息边界。提出了不同的 VSAE 变体来减轻序列建模中的后验崩溃。进行了一系列实验，以证明所提出的单个和混合序列自动编码器在语言建模中的变分序列学习和语义理解以及文档分类和摘要中的性能有了显著提高。

相似文献

Hierarchical and Self-Attended Sequence Autoencoder.

IEEE Trans Pattern Anal Mach Intell. 2022 Sep;44(9):4975-4986. doi: 10.1109/TPAMI.2021.3068187. Epub 2022 Aug 4.

Learning Hierarchical Variational Autoencoders With Mutual Information Maximization for Autoregressive Sequence Modeling.

IEEE Trans Pattern Anal Mach Intell. 2023 Feb;45(2):1949-1962. doi: 10.1109/TPAMI.2022.3160509. Epub 2023 Jan 6.

Attention Autoencoder for Generative Latent Representational Learning in Anomaly Detection.

Sensors (Basel). 2021 Dec 24;22(1):123. doi: 10.3390/s22010123.

Translating medical image to radiological report: Adaptive multilevel multi-attention approach.

Comput Methods Programs Biomed. 2022 Jun;221:106853. doi: 10.1016/j.cmpb.2022.106853. Epub 2022 May 4.

Improving Chemical Autoencoder Latent Space and Molecular Generation Diversity with Heteroencoders.

Biomolecules. 2018 Oct 30;8(4):131. doi: 10.3390/biom8040131.

DyVGRNN: DYnamic mixture Variational Graph Recurrent Neural Networks.

Neural Netw. 2023 Aug;165:596-610. doi: 10.1016/j.neunet.2023.05.048. Epub 2023 Jun 5.

An informative dual ForkNet for video anomaly detection.

Neural Netw. 2024 Nov;179:106509. doi: 10.1016/j.neunet.2024.106509. Epub 2024 Jul 11.

Deep Latent-Variable Kernel Learning.

IEEE Trans Cybern. 2022 Oct;52(10):10276-10289. doi: 10.1109/TCYB.2021.3062140. Epub 2022 Sep 19.

An LSTM-based adversarial variational autoencoder framework for self-supervised neural decoding of behavioral choices.

J Neural Eng. 2024 Jul 9;21(3). doi: 10.1088/1741-2552/ad3eb3.

A stochastic variational framework for Recurrent Gaussian Processes models.

Neural Netw. 2019 Apr;112:54-72. doi: 10.1016/j.neunet.2019.01.005. Epub 2019 Feb 1.

Suppr 超能文献

核心技术专利：CN118964589B侵权必究

相似文献

Hierarchical and Self-Attended Sequence Autoencoder.

IEEE Trans Pattern Anal Mach Intell. 2022 Sep;44(9):4975-4986. doi: 10.1109/TPAMI.2021.3068187. Epub 2022 Aug 4.

Learning Hierarchical Variational Autoencoders With Mutual Information Maximization for Autoregressive Sequence Modeling.

IEEE Trans Pattern Anal Mach Intell. 2023 Feb;45(2):1949-1962. doi: 10.1109/TPAMI.2022.3160509. Epub 2023 Jan 6.

Attention Autoencoder for Generative Latent Representational Learning in Anomaly Detection.

Sensors (Basel). 2021 Dec 24;22(1):123. doi: 10.3390/s22010123.

Translating medical image to radiological report: Adaptive multilevel multi-attention approach.

Comput Methods Programs Biomed. 2022 Jun;221:106853. doi: 10.1016/j.cmpb.2022.106853. Epub 2022 May 4.

Improving Chemical Autoencoder Latent Space and Molecular Generation Diversity with Heteroencoders.

Biomolecules. 2018 Oct 30;8(4):131. doi: 10.3390/biom8040131.

DyVGRNN: DYnamic mixture Variational Graph Recurrent Neural Networks.

Neural Netw. 2023 Aug;165:596-610. doi: 10.1016/j.neunet.2023.05.048. Epub 2023 Jun 5.

An informative dual ForkNet for video anomaly detection.

Neural Netw. 2024 Nov;179:106509. doi: 10.1016/j.neunet.2024.106509. Epub 2024 Jul 11.

Deep Latent-Variable Kernel Learning.

IEEE Trans Cybern. 2022 Oct;52(10):10276-10289. doi: 10.1109/TCYB.2021.3062140. Epub 2022 Sep 19.

An LSTM-based adversarial variational autoencoder framework for self-supervised neural decoding of behavioral choices.

J Neural Eng. 2024 Jul 9;21(3). doi: 10.1088/1741-2552/ad3eb3.

A stochastic variational framework for Recurrent Gaussian Processes models.

Neural Netw. 2019 Apr;112:54-72. doi: 10.1016/j.neunet.2019.01.005. Epub 2019 Feb 1.

Hierarchical and Self-Attended Sequence Autoencoder.

出版信息

相似文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献