解缠深度生成模型揭示了人类面部处理网络的编码原理。

Disentangled deep generative models reveal coding principles of the human face processing network.

机构信息

Department of Cognitive Science, Johns Hopkins University, Baltimore, Maryland, United States of America.

出版信息

PLoS Comput Biol. 2024 Feb 26;20(2):e1011887. doi: 10.1371/journal.pcbi.1011887. eCollection 2024 Feb.

DOI:10.1371/journal.pcbi.1011887

PMID:38408105

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10919870/

Abstract

Despite decades of research, much is still unknown about the computations carried out in the human face processing network. Recently, deep networks have been proposed as a computational account of human visual processing, but while they provide a good match to neural data throughout visual cortex, they lack interpretability. We introduce a method for interpreting brain activity using a new class of deep generative models, disentangled representation learning models, which learn a low-dimensional latent space that "disentangles" different semantically meaningful dimensions of faces, such as rotation, lighting, or hairstyle, in an unsupervised manner by enforcing statistical independence between dimensions. We find that the majority of our model's learned latent dimensions are interpretable by human raters. Further, these latent dimensions serve as a good encoding model for human fMRI data. We next investigate the representation of different latent dimensions across face-selective voxels. We find that low- and high-level face features are represented in posterior and anterior face-selective regions, respectively, corroborating prior models of human face recognition. Interestingly, though, we find identity-relevant and irrelevant face features across the face processing network. Finally, we provide new insight into the few "entangled" (uninterpretable) dimensions in our model by showing that they match responses in the ventral stream and carry information about facial identity. Disentangled face encoding models provide an exciting alternative to standard "black box" deep learning approaches for modeling and interpreting human brain data.

摘要

尽管已经进行了几十年的研究，但人类面部处理网络所进行的计算仍有许多未知之处。最近，深度网络被提出作为人类视觉处理的计算模型，但尽管它们与整个视觉皮层的神经数据非常匹配，但它们缺乏可解释性。我们引入了一种使用新的深度生成模型类——解缠表示学习模型来解释大脑活动的方法，该模型通过强制维度之间的统计独立性，以无监督的方式学习一个低维潜在空间，该潜在空间“解缠”了面部的不同语义有意义的维度，例如旋转、光照或发型。我们发现，我们模型的大多数学习到的潜在维度都可以被人类评分者解释。此外，这些潜在维度可以作为人类 fMRI 数据的良好编码模型。接下来，我们研究了不同潜在维度在面部选择性体素中的表示。我们发现，低水平和高水平的面部特征分别在后部和前部面部选择性区域中得到表示，这与人类面部识别的先前模型相符。有趣的是，尽管我们在面部处理网络中发现了与身份相关和不相关的面部特征。最后，我们通过展示它们与腹侧流的反应相匹配并携带有关面部身份的信息，为我们模型中少数“纠缠”（不可解释）的维度提供了新的见解。解缠的面部编码模型为建模和解释人类大脑数据提供了一种令人兴奋的替代标准“黑盒”深度学习方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0b9a/10919870/4dd527d7486f/pcbi.1011887.g001.jpg

相似文献

Disentangled deep generative models reveal coding principles of the human face processing network.

PLoS Comput Biol. 2024 Feb 26;20(2):e1011887. doi: 10.1371/journal.pcbi.1011887. eCollection 2024 Feb.

Disentangling the Representation of Identity from Head View Along the Human Face Processing Pathway.

Cereb Cortex. 2017 Jan 1;27(1):46-53. doi: 10.1093/cercor/bhw344.

Intracranial Electroencephalography and Deep Neural Networks Reveal Shared Substrates for Representations of Face Identity and Expressions.

J Neurosci. 2023 Jun 7;43(23):4291-4303. doi: 10.1523/JNEUROSCI.1277-22.2023. Epub 2023 May 4.

FFA and OFA Encode Distinct Types of Face Identity Information.

J Neurosci. 2021 Mar 3;41(9):1952-1969. doi: 10.1523/JNEUROSCI.1449-20.2020. Epub 2021 Jan 15.

Modeling face recognition in the predictive coding framework: A combined computational modeling and functional imaging study.

Cortex. 2023 Nov;168:203-225. doi: 10.1016/j.cortex.2023.05.021. Epub 2023 Jul 26.

Finding Distributed Needles in Neural Haystacks.

J Neurosci. 2021 Feb 3;41(5):1019-1032. doi: 10.1523/JNEUROSCI.0904-20.2020. Epub 2020 Dec 17.

Separated and overlapping neural coding of face and body identity.

Hum Brain Mapp. 2021 Sep;42(13):4242-4260. doi: 10.1002/hbm.25544. Epub 2021 May 25.

Heterogeneous Face Interpretable Disentangled Representation for Joint Face Recognition and Synthesis.

IEEE Trans Neural Netw Learn Syst. 2022 Oct;33(10):5611-5625. doi: 10.1109/TNNLS.2021.3071119. Epub 2022 Oct 5.

Data-point-wise spatiotemporal mapping of human ventral visual areas: Use of spatial frequency/luminance-modulated chromatic faces.

Neuroimage. 2021 Oct 1;239:118325. doi: 10.1016/j.neuroimage.2021.118325. Epub 2021 Jun 30.

Orientation Encoding and Viewpoint Invariance in Face Recognition: Inferring Neural Properties from Large-Scale Signals.

Neuroscientist. 2018 Dec;24(6):582-608. doi: 10.1177/1073858418769554. Epub 2018 Jun 1.

本文引用的文献

High-performing neural network models of visual cortex benefit from high latent dimensionality.

PLoS Comput Biol. 2024 Jan 10;20(1):e1011792. doi: 10.1371/journal.pcbi.1011792. eCollection 2024 Jan.

Modeling naturalistic face processing in humans with deep convolutional neural networks.

Proc Natl Acad Sci U S A. 2023 Oct 24;120(43):e2304085120. doi: 10.1073/pnas.2304085120. Epub 2023 Oct 17.

The neural code for "face cells" is not face-specific.

Sci Adv. 2023 Sep;9(35):eadg1736. doi: 10.1126/sciadv.adg1736. Epub 2023 Aug 30.

Functional selectivity for social interaction perception in the human superior temporal sulcus during natural viewing.

Neuroimage. 2021 Dec 15;245:118741. doi: 10.1016/j.neuroimage.2021.118741. Epub 2021 Nov 17.

Unsupervised deep learning identifies semantic disentanglement in single inferotemporal face patch neurons.

Nat Commun. 2021 Nov 9;12(1):6456. doi: 10.1038/s41467-021-26751-5.

Face Recognition by Humans and Machines: Three Fundamental Advances from Deep Learning.

Annu Rev Vis Sci. 2021 Sep 15;7:543-570. doi: 10.1146/annurev-vision-093019-111701. Epub 2021 Aug 4.

FFA and OFA Encode Distinct Types of Face Identity Information.

J Neurosci. 2021 Mar 3;41(9):1952-1969. doi: 10.1523/JNEUROSCI.1449-20.2020. Epub 2021 Jan 15.

Efficient inverse graphics in biological face processing.

Sci Adv. 2020 Mar 4;6(10):eaax5979. doi: 10.1126/sciadv.aax5979. eCollection 2020 Mar.

Reconstructing faces from fMRI patterns using deep generative neural networks.

Commun Biol. 2019 May 21;2:193. doi: 10.1038/s42003-019-0438-y. eCollection 2019.

Variational autoencoder: An unsupervised model for encoding and decoding fMRI activity in visual cortex.

Neuroimage. 2019 Sep;198:125-136. doi: 10.1016/j.neuroimage.2019.05.039. Epub 2019 May 16.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

解缠深度生成模型揭示了人类面部处理网络的编码原理。

Disentangled deep generative models reveal coding principles of the human face processing network.

机构信息

Department of Cognitive Science, Johns Hopkins University, Baltimore, Maryland, United States of America.

出版信息

PLoS Comput Biol. 2024 Feb 26;20(2):e1011887. doi: 10.1371/journal.pcbi.1011887. eCollection 2024 Feb.

DOI:10.1371/journal.pcbi.1011887

PMID:38408105

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10919870/

Abstract

摘要

解缠深度生成模型揭示了人类面部处理网络的编码原理。

Disentangled deep generative models reveal coding principles of the human face processing network.

机构信息

出版信息

相似文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

解缠深度生成模型揭示了人类面部处理网络的编码原理。

Disentangled deep generative models reveal coding principles of the human face processing network.

机构信息

出版信息

相似文献

本文引用的文献