表征学习的流形学习视角：无编码器学习解码器和表征

A Manifold Learning Perspective on Representation Learning: Learning Decoder and Representations without an Encoder.

作者信息

Schuster Viktoria, Krogh Anders

机构信息

Center for Health Data Science, University of Copenhagen, 2200 Copenhagen, Denmark.

Department of Computer Science, University of Copenhagen, 2100 Copenhagen, Denmark.

出版信息

Entropy (Basel). 2021 Oct 25;23(11):1403. doi: 10.3390/e23111403.

DOI:10.3390/e23111403

PMID:34828101

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8625121/

Abstract

Autoencoders are commonly used in representation learning. They consist of an encoder and a decoder, which provide a straightforward method to map -dimensional data in input space to a lower -dimensional representation space and back. The decoder itself defines an -dimensional manifold in input space. Inspired by manifold learning, we showed that the decoder can be trained on its own by learning the representations of the training samples along with the decoder weights using gradient descent. A sum-of-squares loss then corresponds to optimizing the manifold to have the smallest Euclidean distance to the training samples, and similarly for other loss functions. We derived expressions for the number of samples needed to specify the encoder and decoder and showed that the decoder generally requires much fewer training samples to be well-specified compared to the encoder. We discuss the training of autoencoders in this perspective and relate it to previous work in the field that uses noisy training examples and other types of regularization. On the natural image data sets MNIST and CIFAR10, we demonstrated that the decoder is much better suited to learn a low-dimensional representation, especially when trained on small data sets. Using simulated gene regulatory data, we further showed that the decoder alone leads to better generalization and meaningful representations. Our approach of training the decoder alone facilitates representation learning even on small data sets and can lead to improved training of autoencoders. We hope that the simple analyses presented will also contribute to an improved conceptual understanding of representation learning.

摘要

自动编码器常用于表示学习。它们由一个编码器和一个解码器组成，提供了一种直接的方法，将输入空间中的高维数据映射到低维表示空间，然后再映射回来。解码器本身在输入空间中定义了一个高维流形。受流形学习的启发，我们表明，可以通过使用梯度下降法学习训练样本的表示以及解码器权重，单独训练解码器。平方和损失对应于优化流形，使其与训练样本的欧几里得距离最小，其他损失函数也是如此。我们推导了指定编码器和解码器所需样本数量的表达式，并表明与编码器相比，解码器通常需要少得多的训练样本就能得到很好的指定。我们从这个角度讨论自动编码器的训练，并将其与该领域之前使用有噪声训练示例和其他正则化类型的工作联系起来。在自然图像数据集MNIST和CIFAR10上，我们证明了解码器更适合学习低维表示，特别是在小数据集上进行训练时。使用模拟的基因调控数据，我们进一步表明，仅解码器就能带来更好的泛化能力和有意义的表示。我们单独训练解码器的方法即使在小数据集上也有助于表示学习，并且可以改进自动编码器的训练。我们希望所呈现的简单分析也将有助于提高对表示学习的概念理解。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bfce/8625121/af5e89c1631a/entropy-23-01403-g001.jpg

相似文献

A Manifold Learning Perspective on Representation Learning: Learning Decoder and Representations without an Encoder.表征学习的流形学习视角：无编码器学习解码器和表征

Entropy (Basel). 2021 Oct 25;23(11):1403. doi: 10.3390/e23111403.

SPD Manifold Deep Metric Learning for Image Set Classification.用于图像集分类的 SPD 流形深度度量学习

IEEE Trans Neural Netw Learn Syst. 2024 Jul;35(7):8924-8938. doi: 10.1109/TNNLS.2022.3216811. Epub 2024 Jul 8.

Generative adversarial networks with decoder-encoder output noises.生成对抗网络与解码器编码器输出噪声。

Neural Netw. 2020 Jul;127:19-28. doi: 10.1016/j.neunet.2020.04.005. Epub 2020 Apr 9.

Locally Embedding Autoencoders: A Semi-Supervised Manifold Learning Approach of Document Representation.局部嵌入自动编码器：一种文档表示的半监督流形学习方法。

PLoS One. 2016 Jan 19;11(1):e0146672. doi: 10.1371/journal.pone.0146672. eCollection 2016.

Supervising the Decoder of Variational Autoencoders to Improve Scientific Utility.监督变分自编码器的解码器以提高科学效用。

IEEE Trans Signal Process. 2022;70:5954-5966. doi: 10.1109/tsp.2022.3230329. Epub 2022 Dec 19.

Noise-robust voice conversion with domain adversarial training.基于域对抗训练的抗噪语音转换。

Neural Netw. 2022 Apr;148:74-84. doi: 10.1016/j.neunet.2022.01.003. Epub 2022 Jan 13.

An Overview of Variational Autoencoders for Source Separation, Finance, and Bio-Signal Applications.用于源分离、金融和生物信号应用的变分自编码器概述。

Entropy (Basel). 2021 Dec 28;24(1):55. doi: 10.3390/e24010055.

A Decoder-Free Variational Deep Embedding for Unsupervised Clustering.一种用于无监督聚类的无解码器变分深度嵌入

IEEE Trans Neural Netw Learn Syst. 2022 Oct;33(10):5681-5693. doi: 10.1109/TNNLS.2021.3071275. Epub 2022 Oct 5.

Manifold adversarial training for supervised and semi-supervised learning.多流对抗训练用于监督学习和半监督学习。

Neural Netw. 2021 Aug;140:282-293. doi: 10.1016/j.neunet.2021.03.031. Epub 2021 Mar 26.

Deep Residual Autoencoders for Expectation Maximization-Inspired Dictionary Learning.深度残差自编码器在期望最大化启发式字典学习中的应用。

IEEE Trans Neural Netw Learn Syst. 2021 Jun;32(6):2415-2429. doi: 10.1109/TNNLS.2020.3005348. Epub 2021 Jun 2.

引用本文的文献

Comparing factor mixture modeling and conditional Gaussian mixture variational autoencoders for cognitive profile clustering.比较因子混合建模和条件高斯混合变分自编码器用于认知特征聚类

Front Psychol. 2025 May 9;16:1474292. doi: 10.3389/fpsyg.2025.1474292. eCollection 2025.

multiDGD: A versatile deep generative model for multi-omics data.多 DGD：一种用于多组学数据的多功能深度生成模型。

Nat Commun. 2024 Nov 20;15(1):10031. doi: 10.1038/s41467-024-53340-z.

N-of-one differential gene expression without control samples using a deep generative model.使用深度生成模型进行无对照样本的 N-of-one 差异基因表达分析。

Genome Biol. 2023 Nov 16;24(1):263. doi: 10.1186/s13059-023-03104-7.

The Deep Generative Decoder: MAP estimation of representations improves modelling of single-cell RNA data.深度生成解码器：表示的 MAP 估计可改进单细胞 RNA 数据的建模。

Bioinformatics. 2023 Sep 2;39(9). doi: 10.1093/bioinformatics/btad497.

本文引用的文献

Representation learning: a review and new perspectives.表示学习：综述与新视角。

IEEE Trans Pattern Anal Mach Intell. 2013 Aug;35(8):1798-828. doi: 10.1109/TPAMI.2013.50.

Neocognitron: a self organizing neural network model for a mechanism of pattern recognition unaffected by shift in position.新认知机：一种用于模式识别机制的自组织神经网络模型，不受位置移动的影响。

Biol Cybern. 1980;36(4):193-202. doi: 10.1007/BF00344251.

Learning the hidden structure of speech.学习语音的隐藏结构。

J Acoust Soc Am. 1988 Apr;83(4):1615-26. doi: 10.1121/1.395916.

Auto-association by multilayer perceptrons and singular value decomposition.多层感知器和奇异值分解的自联想

Biol Cybern. 1988;59(4-5):291-4. doi: 10.1007/BF00332918.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

表征学习的流形学习视角：无编码器学习解码器和表征

A Manifold Learning Perspective on Representation Learning: Learning Decoder and Representations without an Encoder.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献