研究深度生成模型的内在自上而下动力学。

Investigating the intrinsic top-down dynamics of deep generative models.

作者信息

Tausani Lorenzo, Testolin Alberto, Zorzi Marco

机构信息

Department of General Psychology and Padova Neuroscience Center, University of Padova, Padova, Italy.

Department of Mathematics, University of Padova, Padova, Italy.

出版信息

Sci Rep. 2025 Jan 22;15(1):2875. doi: 10.1038/s41598-024-85055-y.

DOI:10.1038/s41598-024-85055-y

PMID:39843473

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11754800/

Abstract

Hierarchical generative models can produce data samples based on the statistical structure of their training distribution. This capability can be linked to current theories in computational neuroscience, which propose that spontaneous brain activity at rest is the manifestation of top-down dynamics of generative models detached from action-perception cycles. A popular class of hierarchical generative models is that of Deep Belief Networks (DBNs), which are energy-based deep learning architectures that can learn multiple levels of representations in a completely unsupervised way exploiting Hebbian-like learning mechanisms. In this work, we study the generative dynamics of a recent extension of the DBN, the iterative DBN (iDBN), which more faithfully simulates neurocognitive development by jointly tuning the connection weights across all layers of the hierarchy. We characterize the number of states visited during top-down sampling and investigate whether the heterogeneity of visited attractors could be increased by initiating the generation process from biased hidden states. To this end, we train iDBN models on well-known datasets containing handwritten digits and pictures of human faces, and show that the ability to generate diverse data prototypes can be enhanced by initializing top-down sampling from "chimera states", which represent high-level features combining multiple abstract representations of the sensory data. Although the models are not always able to transition between all potential target states within a single-generation trajectory, the iDBN shows richer top-down dynamics in comparison to a shallow generative model (a single-layer Restricted Bolzamann Machine). We further show that the generated samples can be used to support continual learning through generative replay mechanisms. Our findings suggest that the top-down dynamics of hierarchical generative models is significantly influenced by the shape of the energy function, which depends both on the depth of the processing architecture and on the statistical structure of the sensory data.

摘要

分层生成模型可以根据其训练分布的统计结构生成数据样本。这种能力可以与计算神经科学中的当前理论联系起来，这些理论提出，静息时的自发脑活动是脱离动作 - 感知循环的生成模型自上而下动态的表现。一类流行的分层生成模型是深度信念网络（DBN），它是基于能量的深度学习架构，可以利用类似赫布学习机制以完全无监督的方式学习多个层次的表示。在这项工作中，我们研究了DBN的一种最新扩展——迭代DBN（iDBN）的生成动态，它通过联合调整层次结构所有层的连接权重，更忠实地模拟神经认知发展。我们刻画了自上而下采样过程中访问的状态数量，并研究了通过从有偏隐藏状态启动生成过程，是否可以增加访问吸引子的异质性。为此，我们在包含手写数字和人脸图片的知名数据集上训练iDBN模型，并表明通过从“嵌合状态”初始化自上而下采样，可以增强生成多样化数据原型的能力，“嵌合状态”代表结合了感官数据多个抽象表示的高级特征。尽管模型并不总是能够在单代轨迹内从所有潜在目标状态之间进行转换，但与浅层生成模型（单层受限玻尔兹曼机）相比，iDBN显示出更丰富的自上而下动态。我们进一步表明，生成的样本可用于通过生成重放机制支持持续学习。我们的研究结果表明，分层生成模型的自上而下动态受到能量函数形状的显著影响，而能量函数形状既取决于处理架构的深度，也取决于感官数据的统计结构。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/73e5/11754800/c1811e46bd63/41598_2024_85055_Fig1_HTML.jpg

相似文献

Investigating the intrinsic top-down dynamics of deep generative models.

Sci Rep. 2025 Jan 22;15(1):2875. doi: 10.1038/s41598-024-85055-y.

Probabilistic Models and Generative Neural Networks: Towards an Unified Framework for Modeling Normal and Impaired Neurocognitive Functions.

Front Comput Neurosci. 2016 Jul 13;10:73. doi: 10.3389/fncom.2016.00073. eCollection 2016.

Learning representation hierarchies by sharing visual features: a computational investigation of Persian character recognition with unsupervised deep learning.

Cogn Process. 2017 Aug;18(3):273-284. doi: 10.1007/s10339-017-0796-7. Epub 2017 Feb 25.

Neural decoding with hierarchical generative models.

Neural Comput. 2010 Dec;22(12):3127-42. doi: 10.1162/NECO_a_00047. Epub 2010 Sep 21.

Likelihood approximation networks (LANs) for fast inference of simulation models in cognitive neuroscience.

Elife. 2021 Apr 6;10:e65074. doi: 10.7554/eLife.65074.

Novel deep generative simultaneous recurrent model for efficient representation learning.

Neural Netw. 2018 Nov;107:12-22. doi: 10.1016/j.neunet.2018.04.020. Epub 2018 Aug 9.

Comprehensive single-cell RNA-seq analysis using deep interpretable generative modeling guided by biological hierarchy knowledge.

Brief Bioinform. 2024 May 23;25(4). doi: 10.1093/bib/bbae314.

Modeling language and cognition with deep unsupervised learning: a tutorial overview.

Front Psychol. 2013 Aug 20;4:515. doi: 10.3389/fpsyg.2013.00515. eCollection 2013.

Deep generative learning for automated EHR diagnosis of traditional Chinese medicine.

Comput Methods Programs Biomed. 2019 Jun;174:17-23. doi: 10.1016/j.cmpb.2018.05.008. Epub 2018 May 4.

Where do features come from?

Cogn Sci. 2014 Aug;38(6):1078-101. doi: 10.1111/cogs.12049. Epub 2013 Jun 25.

本文引用的文献

Weaker number sense accounts for impaired numerosity perception in dyscalculia: Behavioral and computational evidence.

Dev Sci. 2024 Nov;27(6):e13538. doi: 10.1111/desc.13538. Epub 2024 Jul 1.

Spontaneous Activity Patterns in Human Attention Networks Code for Hand Movements.

J Neurosci. 2023 Mar 15;43(11):1976-1986. doi: 10.1523/JNEUROSCI.1601-22.2023. Epub 2023 Feb 14.

Neural assemblies uncovered by generative modeling explain whole-brain activity statistics and reflect structural connectivity.

Elife. 2023 Jan 17;12:e83139. doi: 10.7554/eLife.83139.

Using top-down modulation to optimally balance shared versus separated task representations.

Neural Netw. 2022 Feb;146:256-271. doi: 10.1016/j.neunet.2021.11.030. Epub 2021 Dec 2.

Visual exposure enhances stimulus encoding and persistence in primary cortex.

Proc Natl Acad Sci U S A. 2021 Oct 26;118(43). doi: 10.1073/pnas.2105276118.

Barriers and dynamical paths in alternating Gibbs sampling of restricted Boltzmann machines.

Phys Rev E. 2021 Sep;104(3-1):034109. doi: 10.1103/PhysRevE.104.034109.

Replay in minds and machines.

Neurosci Biobehav Rev. 2021 Oct;129:367-388. doi: 10.1016/j.neubiorev.2021.08.002. Epub 2021 Aug 8.

The secret life of predictive brains: what's spontaneous activity for?

Trends Cogn Sci. 2021 Sep;25(9):730-743. doi: 10.1016/j.tics.2021.05.007. Epub 2021 Jun 16.

The phase space of meaning model of psychopathology: A computer simulation modelling study.

PLoS One. 2021 Apr 26;16(4):e0249320. doi: 10.1371/journal.pone.0249320. eCollection 2021.

If deep learning is the answer, what is the question?

Nat Rev Neurosci. 2021 Jan;22(1):55-67. doi: 10.1038/s41583-020-00395-8. Epub 2020 Nov 16.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

研究深度生成模型的内在自上而下动力学。

Investigating the intrinsic top-down dynamics of deep generative models.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

本文引用的文献