预测学习作为一种网络机制，用于提取低维潜在空间表示。

Predictive learning as a network mechanism for extracting low-dimensional latent space representations.

机构信息

University of Washington Center for Computational Neuroscience and Swartz Center for Theoretical Neuroscience, Seattle, WA, USA.

Department of Applied Mathematics, University of Washington, Seattle, WA, USA.

出版信息

Nat Commun. 2021 Mar 3;12(1):1417. doi: 10.1038/s41467-021-21696-1.

DOI:10.1038/s41467-021-21696-1

PMID:33658520

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7930246/

Abstract

Artificial neural networks have recently achieved many successes in solving sequential processing and planning tasks. Their success is often ascribed to the emergence of the task's low-dimensional latent structure in the network activity - i.e., in the learned neural representations. Here, we investigate the hypothesis that a means for generating representations with easily accessed low-dimensional latent structure, possibly reflecting an underlying semantic organization, is through learning to predict observations about the world. Specifically, we ask whether and when network mechanisms for sensory prediction coincide with those for extracting the underlying latent variables. Using a recurrent neural network model trained to predict a sequence of observations we show that network dynamics exhibit low-dimensional but nonlinearly transformed representations of sensory inputs that map the latent structure of the sensory environment. We quantify these results using nonlinear measures of intrinsic dimensionality and linear decodability of latent variables, and provide mathematical arguments for why such useful predictive representations emerge. We focus throughout on how our results can aid the analysis and interpretation of experimental data.

摘要

人工神经网络最近在解决顺序处理和规划任务方面取得了许多成功。它们的成功通常归因于网络活动中出现的任务的低维潜在结构 - 即，在学习到的神经表示中。在这里，我们研究了这样一个假设，即生成具有易于访问的低维潜在结构的表示的一种方法可能是通过学习预测有关世界的观察结果。具体来说，我们问的是网络机制是否以及何时用于预测感官，同时用于提取潜在变量。我们使用经过训练以预测一系列观察结果的循环神经网络模型来证明，网络动态表现出对感官输入的低维但非线性变换的表示形式，这些表示形式映射了感官环境的潜在结构。我们使用潜在变量的内在维度的非线性度量和线性可解码性来量化这些结果，并提供了为什么会出现这种有用的预测表示的数学论据。我们在整个过程中都关注我们的结果如何帮助分析和解释实验数据。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d048/7930246/0bd8b6ba3bf8/41467_2021_21696_Fig1_HTML.jpg

相似文献

Predictive learning as a network mechanism for extracting low-dimensional latent space representations.

Nat Commun. 2021 Mar 3;12(1):1417. doi: 10.1038/s41467-021-21696-1.

Structured Semantic Knowledge Can Emerge Automatically from Predicting Word Sequences in Child-Directed Speech.

Front Psychol. 2018 Feb 22;9:133. doi: 10.3389/fpsyg.2018.00133. eCollection 2018.

Latent representations in hippocampal network model co-evolve with behavioral exploration of task structure.

Nat Commun. 2024 Jan 23;15(1):687. doi: 10.1038/s41467-024-44871-6.

Neural Circuit Dynamics for Sensory Detection.

J Neurosci. 2020 Apr 22;40(17):3408-3423. doi: 10.1523/JNEUROSCI.2185-19.2020. Epub 2020 Mar 12.

Dynamic network modeling and dimensionality reduction for human ECoG activity.

J Neural Eng. 2019 Aug 14;16(5):056014. doi: 10.1088/1741-2552/ab2214.

Linking Connectivity, Dynamics, and Computations in Low-Rank Recurrent Neural Networks.

Neuron. 2018 Aug 8;99(3):609-623.e29. doi: 10.1016/j.neuron.2018.07.003. Epub 2018 Jul 26.

Goal-Directed Planning for Habituated Agents by Active Inference Using a Variational Recurrent Neural Network.

Entropy (Basel). 2020 May 18;22(5):564. doi: 10.3390/e22050564.

Representation in natural and artificial agents: an embodied cognitive science perspective.

Z Naturforsch C J Biosci. 1998 Jul-Aug;53(7-8):480-503. doi: 10.1515/znc-1998-7-804.

Performance of a Computational Model of the Mammalian Olfactory System

Gaussian process based nonlinear latent structure discovery in multivariate spike train data.

Adv Neural Inf Process Syst. 2017 Dec;30:3496-3505.

引用本文的文献

Three types of remapping with linear decoders: a population-geometric perspective.

bioRxiv. 2025 Aug 11:2025.03.14.643251. doi: 10.1101/2025.03.14.643251.

Exploring the Architectural Biases of the Cortical Microcircuit.

Neural Comput. 2025 Jul 24:1-49. doi: 10.1162/neco.a.23.

A mechanism for the emergence of low-dimensional structures in brain dynamics.

NPJ Syst Biol Appl. 2025 Apr 10;11(1):32. doi: 10.1038/s41540-025-00499-w.

TiDHy: Timescale Demixing via Hypernetworks to learn simultaneous dynamics from mixed observations.

bioRxiv. 2025 Jan 31:2025.01.28.635316. doi: 10.1101/2025.01.28.635316.

Image biomarkers and explainable AI: handcrafted features versus deep learned features.

Eur Radiol Exp. 2024 Nov 19;8(1):130. doi: 10.1186/s41747-024-00529-y.

A hierarchical active inference model of spatial alternation tasks and the hippocampal-prefrontal circuit.

Nat Commun. 2024 Nov 15;15(1):9892. doi: 10.1038/s41467-024-54257-3.

Reach-dependent reorientation of rotational dynamics in motor cortex.

Nat Commun. 2024 Aug 15;15(1):7007. doi: 10.1038/s41467-024-51308-7.

Exploring the Architectural Biases of the Canonical Cortical Microcircuit.

bioRxiv. 2024 May 24:2024.05.23.595629. doi: 10.1101/2024.05.23.595629.

Representational drift as a result of implicit regularization.

Elife. 2024 May 2;12:RP90069. doi: 10.7554/eLife.90069.

Computational role of structure in neural activity and connectivity.

Trends Cogn Sci. 2024 Jul;28(7):677-690. doi: 10.1016/j.tics.2024.03.003. Epub 2024 Mar 28.

本文引用的文献

What Are Memories For? The Hippocampus Bridges Past Experience with Future Decisions.

Trends Cogn Sci. 2020 Jul;24(7):542-556. doi: 10.1016/j.tics.2020.04.004. Epub 2020 Jun 5.

A review of dynamic network models with latent variables.

Stat Surv. 2018;12:105-135. doi: 10.1214/18-SS121. Epub 2018 Sep 3.

Modulating the Use of Multiple Memory Systems in Value-based Decisions with Contextual Novelty.

J Cogn Neurosci. 2019 Oct;31(10):1455-1467. doi: 10.1162/jocn_a_01447. Epub 2019 Jul 19.

The successor representation in human reinforcement learning.

Nat Hum Behav. 2017 Sep;1(9):680-692. doi: 10.1038/s41562-017-0180-8. Epub 2017 Aug 28.

Hippocampal Contributions to Model-Based Planning and Spatial Memory.

Neuron. 2019 May 8;102(3):683-693.e4. doi: 10.1016/j.neuron.2019.02.014. Epub 2019 Mar 11.

Origin and role of path integration in the cognitive representations of the hippocampus: computational insights into open questions.

J Exp Biol. 2019 Feb 6;222(Pt Suppl 1):jeb188912. doi: 10.1242/jeb.188912.

Task representations in neural networks trained to perform many cognitive tasks.

Nat Neurosci. 2019 Feb;22(2):297-306. doi: 10.1038/s41593-018-0310-2. Epub 2019 Jan 14.

Linear-nonlinear-time-warp-poisson models of neural activity.

J Comput Neurosci. 2018 Dec;45(3):173-191. doi: 10.1007/s10827-018-0696-6. Epub 2018 Oct 8.

Neurocomputational Dynamics of Sequence Learning.

Neuron. 2018 Jun 27;98(6):1282-1293.e4. doi: 10.1016/j.neuron.2018.05.013. Epub 2018 May 31.

Vector-based navigation using grid-like representations in artificial agents.

Nature. 2018 May;557(7705):429-433. doi: 10.1038/s41586-018-0102-6. Epub 2018 May 9.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

预测学习作为一种网络机制，用于提取低维潜在空间表示。

Predictive learning as a network mechanism for extracting low-dimensional latent space representations.

机构信息

University of Washington Center for Computational Neuroscience and Swartz Center for Theoretical Neuroscience, Seattle, WA, USA.

Department of Applied Mathematics, University of Washington, Seattle, WA, USA.

出版信息

Nat Commun. 2021 Mar 3;12(1):1417. doi: 10.1038/s41467-021-21696-1.

DOI:10.1038/s41467-021-21696-1

PMID:33658520

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7930246/

Abstract

摘要

预测学习作为一种网络机制，用于提取低维潜在空间表示。

Predictive learning as a network mechanism for extracting low-dimensional latent space representations.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

预测学习作为一种网络机制，用于提取低维潜在空间表示。

Predictive learning as a network mechanism for extracting low-dimensional latent space representations.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献