分层变分自编码器为灵长类大脑中的运动处理提供了一种规范性解释。

Hierarchical VAEs provide a normative account of motion processing in the primate brain.

作者信息

Vafaii Hadi, Yates Jacob L, Butts Daniel A

机构信息

University of Maryland, College Park.

UC Berkeley.

出版信息

bioRxiv. 2023 Nov 5:2023.09.27.559646. doi: 10.1101/2023.09.27.559646.

DOI:10.1101/2023.09.27.559646

PMID:37808629

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10557690/

Abstract

The relationship between perception and inference, as postulated by Helmholtz in the 19th century, is paralleled in modern machine learning by generative models like Variational Autoencoders (VAEs) and their hierarchical variants. Here, we evaluate the role of hierarchical inference and its alignment with brain function in the domain of motion perception. We first introduce a novel synthetic data framework, Retinal Optic Flow Learning (ROFL), which enables control over motion statistics and their causes. We then present a new hierarchical VAE and test it against alternative models on two downstream tasks: (i) predicting ground truth causes of retinal optic flow (e.g., self-motion); and (ii) predicting the responses of neurons in the motion processing pathway of primates. We manipulate the model architectures (hierarchical versus non-hierarchical), loss functions, and the causal structure of the motion stimuli. We find that hierarchical latent structure in the model leads to several improvements. First, it improves the linear decodability of ground truth factors and does so in a sparse and disentangled manner. Second, our hierarchical VAE outperforms previous state-of-the-art models in predicting neuronal responses and exhibits sparse latent-to-neuron relationships. These results depend on the causal structure of the world, indicating that alignment between brains and artificial neural networks depends not only on architecture but also on matching ecologically relevant stimulus statistics. Taken together, our results suggest that hierarchical Bayesian inference underlines the brain's understanding of the world, and hierarchical VAEs can effectively model this understanding.

摘要

19世纪由亥姆霍兹提出的感知与推理之间的关系，在现代机器学习中可由变分自编码器（VAE）及其分层变体等生成模型来类比。在此，我们评估分层推理在运动感知领域中的作用及其与脑功能的一致性。我们首先引入一种新颖的合成数据框架——视网膜光流学习（ROFL），它能够控制运动统计及其成因。然后，我们提出一种新的分层VAE，并在两项下游任务中与其他模型进行测试：（i）预测视网膜光流的真实成因（例如自我运动）；（ii）预测灵长类动物运动处理通路中神经元的反应。我们操纵模型架构（分层与非分层）、损失函数以及运动刺激的因果结构。我们发现模型中的分层潜在结构带来了多项改进。首先，它提高了真实因素的线性可解码性，并且是以稀疏且解缠结的方式实现的。其次，我们的分层VAE在预测神经元反应方面优于先前的最先进模型，并展现出稀疏的潜在与神经元关系。这些结果取决于世界的因果结构，表明大脑与人工神经网络之间的一致性不仅取决于架构，还取决于与生态相关的刺激统计的匹配。综上所述，我们的结果表明分层贝叶斯推理是大脑对世界理解的基础，并且分层VAE能够有效地模拟这种理解。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a210/10627323/bf5b31c1dbe9/nihpp-2023.09.27.559646v2-f0001.jpg

相似文献

Hierarchical VAEs provide a normative account of motion processing in the primate brain.

bioRxiv. 2023 Nov 5:2023.09.27.559646. doi: 10.1101/2023.09.27.559646.

Learning Hierarchical Variational Autoencoders With Mutual Information Maximization for Autoregressive Sequence Modeling.

IEEE Trans Pattern Anal Mach Intell. 2023 Feb;45(2):1949-1962. doi: 10.1109/TPAMI.2022.3160509. Epub 2023 Jan 6.

An Overview of Variational Autoencoders for Source Separation, Finance, and Bio-Signal Applications.

Entropy (Basel). 2021 Dec 28;24(1):55. doi: 10.3390/e24010055.

A multimodal dynamical variational autoencoder for audiovisual speech representation learning.

Neural Netw. 2024 Apr;172:106120. doi: 10.1016/j.neunet.2024.106120. Epub 2024 Jan 11.

ProtWave-VAE: Integrating Autoregressive Sampling with Latent-Based Inference for Data-Driven Protein Design.

ACS Synth Biol. 2023 Dec 15;12(12):3544-3561. doi: 10.1021/acssynbio.3c00261. Epub 2023 Nov 21.

VAEs: Fixing Sample Generation for Regularized VAEs.

Comput Vis ACCV. 2020 Nov-Dec;12625:643-660. doi: 10.1007/978-3-030-69538-5_39. Epub 2021 Feb 25.

Sparse-Coding Variational Autoencoders.

Neural Comput. 2024 Nov 19;36(12):2571-2601. doi: 10.1162/neco_a_01715.

Predicting drug polypharmacology from cell morphology readouts using variational autoencoder latent space arithmetic.

PLoS Comput Biol. 2022 Feb 25;18(2):e1009888. doi: 10.1371/journal.pcbi.1009888. eCollection 2022 Feb.

Conditional Variational Autoencoder for Functional Connectivity Analysis of Autism Spectrum Disorder Functional Magnetic Resonance Imaging Data: A Comparative Study.

Bioengineering (Basel). 2023 Oct 16;10(10):1209. doi: 10.3390/bioengineering10101209.

QARV: Quantization-Aware ResNet VAE for Lossy Image Compression.

IEEE Trans Pattern Anal Mach Intell. 2024 Jan;46(1):436-450. doi: 10.1109/TPAMI.2023.3322904. Epub 2023 Dec 6.

本文引用的文献

High-performing neural network models of visual cortex benefit from high latent dimensionality.

PLoS Comput Biol. 2024 Jan 10;20(1):e1011792. doi: 10.1371/journal.pcbi.1011792. eCollection 2024 Jan.

Generalized Shape Metrics on Neural Representations.

Adv Neural Inf Process Syst. 2021 Dec;34:4738-4750.

Many but not all deep neural network audio models capture brain responses and exhibit correspondence between model stages and brain regions.

PLoS Biol. 2023 Dec 13;21(12):e3002366. doi: 10.1371/journal.pbio.3002366. eCollection 2023 Dec.

Causal inference during closed-loop navigation: parsing of self- and object-motion.

Philos Trans R Soc Lond B Biol Sci. 2023 Sep 25;378(1886):20220344. doi: 10.1098/rstb.2022.0344. Epub 2023 Aug 7.

The neuroconnectionist research programme.

Nat Rev Neurosci. 2023 Jul;24(7):431-450. doi: 10.1038/s41583-023-00705-w. Epub 2023 May 30.

Retinal motion statistics during natural locomotion.

Elife. 2023 May 3;12:e82410. doi: 10.7554/eLife.82410.

Catalyzing next-generation Artificial Intelligence through NeuroAI.

Nat Commun. 2023 Mar 22;14(1):1597. doi: 10.1038/s41467-023-37180-x.

Abstract representations emerge naturally in neural networks trained to perform multiple tasks.

Nat Commun. 2023 Feb 23;14(1):1040. doi: 10.1038/s41467-023-36583-0.

Using artificial neural networks to ask 'why' questions of minds and brains.

Trends Neurosci. 2023 Mar;46(3):240-254. doi: 10.1016/j.tins.2022.12.008. Epub 2023 Jan 17.

Where is the error? Hierarchical predictive coding through dendritic error computation.

Trends Neurosci. 2023 Jan;46(1):45-59. doi: 10.1016/j.tins.2022.09.007. Epub 2022 Nov 18.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

分层变分自编码器为灵长类大脑中的运动处理提供了一种规范性解释。

Hierarchical VAEs provide a normative account of motion processing in the primate brain.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

本文引用的文献