特征加权感受野：复杂特征空间的可解释编码模型。

The feature-weighted receptive field: an interpretable encoding model for complex feature spaces.

机构信息

Medical University of South Carolina, Charleston, SC, USA.

出版信息

Neuroimage. 2018 Oct 15;180(Pt A):188-202. doi: 10.1016/j.neuroimage.2017.06.035. Epub 2017 Jun 20.

DOI:10.1016/j.neuroimage.2017.06.035

PMID:28645845

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5886832/

Abstract

We introduce the feature-weighted receptive field (fwRF), an encoding model designed to balance expressiveness, interpretability and scalability. The fwRF is organized around the notion of a feature map-a transformation of visual stimuli into visual features that preserves the topology of visual space (but not necessarily the native resolution of the stimulus). The key assumption of the fwRF model is that activity in each voxel encodes variation in a spatially localized region across multiple feature maps. This region is fixed for all feature maps; however, the contribution of each feature map to voxel activity is weighted. Thus, the model has two separable sets of parameters: "where" parameters that characterize the location and extent of pooling over visual features, and "what" parameters that characterize tuning to visual features. The "where" parameters are analogous to classical receptive fields, while "what" parameters are analogous to classical tuning functions. By treating these as separable parameters, the fwRF model complexity is independent of the resolution of the underlying feature maps. This makes it possible to estimate models with thousands of high-resolution feature maps from relatively small amounts of data. Once a fwRF model has been estimated from data, spatial pooling and feature tuning can be read-off directly with no (or very little) additional post-processing or in-silico experimentation. We describe an optimization algorithm for estimating fwRF models from data acquired during standard visual neuroimaging experiments. We then demonstrate the model's application to two distinct sets of features: Gabor wavelets and features supplied by a deep convolutional neural network. We show that when Gabor feature maps are used, the fwRF model recovers receptive fields and spatial frequency tuning functions consistent with known organizational principles of the visual cortex. We also show that a fwRF model can be used to regress entire deep convolutional networks against brain activity. The ability to use whole networks in a single encoding model yields state-of-the-art prediction accuracy. Our results suggest a wide variety of uses for the feature-weighted receptive field model, from retinotopic mapping with natural scenes, to regressing the activities of whole deep neural networks onto measured brain activity.

摘要

我们介绍了特征加权感受野（fwRF），这是一种旨在平衡表达能力、可解释性和可扩展性的编码模型。fwRF 的组织围绕着特征图的概念——将视觉刺激转换为保留视觉空间拓扑结构（但不一定保留刺激的原始分辨率）的视觉特征。fwRF 模型的关键假设是，每个体素的活动编码了多个特征图中空间局部区域的变化。该区域对于所有特征图都是固定的；但是，每个特征图对体素活动的贡献是加权的。因此，该模型有两个可分离的参数集：“在哪里”参数，用于描述在视觉特征上进行池化的位置和范围；“是什么”参数，用于描述对视觉特征的调谐。“在哪里”参数类似于经典感受野，而“是什么”参数类似于经典调谐函数。通过将这些参数视为可分离的参数，fwRF 模型的复杂度与基础特征图的分辨率无关。这使得可以从小量数据中估计具有数千个高分辨率特征图的模型。一旦从数据中估计了 fwRF 模型，就可以直接读取空间池化和特征调谐，而无需（或很少）进行额外的后处理或计算机实验。我们描述了一种从标准视觉神经影像学实验中获取的数据中估计 fwRF 模型的优化算法。然后，我们展示了该模型在两组不同特征中的应用：Gabor 小波和由深度卷积神经网络提供的特征。我们表明，当使用 Gabor 特征图时，fwRF 模型恢复的感受野和空间频率调谐函数与视觉皮层的已知组织原则一致。我们还表明，fwRF 模型可用于将整个深度卷积网络回归到大脑活动。在单个编码模型中使用整个网络的能力可产生最先进的预测精度。我们的结果表明，特征加权感受野模型具有广泛的用途，从使用自然场景的视网膜映射，到将整个深度神经网络的活动回归到测量的大脑活动。

相似文献

The feature-weighted receptive field: an interpretable encoding model for complex feature spaces.特征加权感受野：复杂特征空间的可解释编码模型。

Neuroimage. 2018 Oct 15;180(Pt A):188-202. doi: 10.1016/j.neuroimage.2017.06.035. Epub 2017 Jun 20.

Large-scale parameters framework with large convolutional kernel for encoding visual fMRI activity information.基于大卷积核的大规模参数框架，用于编码视觉 fMRI 活动信息。

Cereb Cortex. 2024 Jul 3;34(7). doi: 10.1093/cercor/bhae257.

Stacked regressions and structured variance partitioning for interpretable brain maps.堆叠回归和结构方差分解可用于可解释的脑图谱。

Neuroimage. 2024 Sep;298:120772. doi: 10.1016/j.neuroimage.2024.120772. Epub 2024 Aug 6.

Sensitivity and specificity considerations for fMRI encoding, decoding, and mapping of auditory cortex at ultra-high field.超高场 fMRI 编码、解码和听觉皮层映射的灵敏度和特异性考虑因素。

Neuroimage. 2018 Jan 1;164:18-31. doi: 10.1016/j.neuroimage.2017.03.063. Epub 2017 Mar 31.

A new method for estimating population receptive field topography in visual cortex.一种估计视觉皮层中群体感受野地形的新方法。

Neuroimage. 2013 Nov 1;81:144-157. doi: 10.1016/j.neuroimage.2013.05.026. Epub 2013 May 16.

A visual encoding model based on deep neural networks and transfer learning for brain activity measured by functional magnetic resonance imaging.基于深度神经网络和迁移学习的功能磁共振成像脑活动视觉编码模型。

J Neurosci Methods. 2019 Sep 1;325:108318. doi: 10.1016/j.jneumeth.2019.108318. Epub 2019 Jun 27.

Estimating receptive fields of simple and complex cells in early visual cortex: A convolutional neural network model with parameterized rectification.估计早期视觉皮层中简单和复杂细胞的感受野：具有参数化修正的卷积神经网络模型。

PLoS Comput Biol. 2024 May 31;20(5):e1012127. doi: 10.1371/journal.pcbi.1012127. eCollection 2024 May.

Fixed versus mixed RSA: Explaining visual representations by fixed and mixed feature sets from shallow and deep computational models.固定与混合随机语义分析：用浅层和深层计算模型中的固定和混合特征集解释视觉表征

J Math Psychol. 2017 Feb;76(Pt B):184-197. doi: 10.1016/j.jmp.2016.10.007.

Voxel-to-voxel predictive models reveal unexpected structure in unexplained variance.体素间预测模型揭示了无法解释的变异中的意外结构。

Neuroimage. 2021 Sep;238:118266. doi: 10.1016/j.neuroimage.2021.118266. Epub 2021 Jun 12.

Bayesian population receptive field modelling.贝叶斯群体感受野建模。

Neuroimage. 2018 Oct 15;180(Pt A):173-187. doi: 10.1016/j.neuroimage.2017.09.008. Epub 2017 Sep 8.

引用本文的文献

The Voxelwise Encoding Model framework: A tutorial introduction to fitting encoding models to fMRI data.体素编码模型框架：将编码模型拟合到功能磁共振成像数据的教程介绍。

Imaging Neurosci (Camb). 2025 May 9;3. doi: 10.1162/imag_a_00575. eCollection 2025.

In silico discovery of representational relationships across visual cortex.视觉皮层中代表性关系的计算机模拟发现。

Nat Hum Behav. 2025 Jun 25. doi: 10.1038/s41562-025-02252-z.

Compression-enabled interpretability of voxelwise encoding models.基于体素编码模型的压缩增强可解释性

PLoS Comput Biol. 2025 Feb 19;21(2):e1012822. doi: 10.1371/journal.pcbi.1012822. eCollection 2025 Feb.

A large-scale examination of inductive biases shaping high-level visual representation in brains and machines.大规模考察在大脑和机器中塑造高级视觉表示的归纳偏差。

Nat Commun. 2024 Oct 30;15(1):9383. doi: 10.1038/s41467-024-53147-y.

Stacked regressions and structured variance partitioning for interpretable brain maps.堆叠回归和结构方差分解可用于可解释的脑图谱。

Neuroimage. 2024 Sep;298:120772. doi: 10.1016/j.neuroimage.2024.120772. Epub 2024 Aug 6.

PLoS Comput Biol. 2024 May 31;20(5):e1012127. doi: 10.1371/journal.pcbi.1012127. eCollection 2024 May.

Uncovering the Role of the Early Visual Cortex in Visual Mental Imagery.揭示早期视觉皮层在视觉心理意象中的作用。

Vision (Basel). 2024 May 2;8(2):29. doi: 10.3390/vision8020029.

The cortical representation of language timescales is shared between reading and listening.语言的皮质代表时间尺度在阅读和听力之间是共享的。

Commun Biol. 2024 Mar 7;7(1):284. doi: 10.1038/s42003-024-05909-z.

Human brain responses are modulated when exposed to optimized natural images or synthetically generated images.当人暴露在优化的自然图像或合成生成的图像下时，大脑的反应会被调节。

Commun Biol. 2023 Oct 23;6(1):1076. doi: 10.1038/s42003-023-05440-7.

The Cortical Representation of Language Timescales is Shared between Reading and Listening.语言时间尺度的皮层表征在阅读和听力之间是共享的。

bioRxiv. 2023 Dec 11:2023.01.06.522601. doi: 10.1101/2023.01.06.522601.

本文引用的文献

Deep Neural Networks: A New Framework for Modeling Biological Vision and Brain Information Processing.深度神经网络：一种用于模拟生物视觉和大脑信息处理的新框架。

Annu Rev Vis Sci. 2015 Nov 24;1:417-446. doi: 10.1146/annurev-vision-082114-035447.

Seeing it all: Convolutional network layers map the function of the human visual system.尽收眼底：卷积神经网络层映射出人类视觉系统的功能。

Neuroimage. 2017 May 15;152:184-194. doi: 10.1016/j.neuroimage.2016.10.001. Epub 2016 Oct 21.

Resolving Ambiguities of MVPA Using Explicit Models of Representation.使用显式表征模型解决多体素模式分析的模糊性

Trends Cogn Sci. 2015 Oct;19(10):551-554. doi: 10.1016/j.tics.2015.07.005.

Deep Neural Networks Reveal a Gradient in the Complexity of Neural Representations across the Ventral Stream.深度神经网络揭示了腹侧流中神经表征复杂性的梯度变化。

J Neurosci. 2015 Jul 8;35(27):10005-14. doi: 10.1523/JNEUROSCI.5023-14.2015.

Deep learning.深度学习。

Nature. 2015 May 28;521(7553):436-44. doi: 10.1038/nature14539.

Natural scene statistics account for the representation of scene categories in human visual cortex.自然场景统计数据解释了人类视觉皮层中场景类别的表示。

Neuron. 2013 Sep 4;79(5):1025-34. doi: 10.1016/j.neuron.2013.06.034. Epub 2013 Aug 8.

Cortical representation of animate and inanimate objects in complex natural scenes.复杂自然场景中 animate 和 inanimate 对象的皮层表征。注：animate 可译为“有生命的” ，inanimate 可译为“无生命的” 。这里直接保留英文以便更准确传达原文特定术语含义。

J Physiol Paris. 2012 Sep-Dec;106(5-6):239-49. doi: 10.1016/j.jphysparis.2012.02.001. Epub 2012 Mar 28.

Reconstructing visual experiences from brain activity evoked by natural movies.从自然电影诱发的大脑活动中重建视觉体验。

Curr Biol. 2011 Oct 11;21(19):1641-6. doi: 10.1016/j.cub.2011.08.031. Epub 2011 Sep 22.

Encoding and decoding in fMRI.功能磁共振成像中的编码和解码。

Neuroimage. 2011 May 15;56(2):400-10. doi: 10.1016/j.neuroimage.2010.07.073. Epub 2010 Aug 4.

Bayesian reconstruction of natural images from human brain activity.基于人类大脑活动的自然图像贝叶斯重建。

Neuron. 2009 Sep 24;63(6):902-15. doi: 10.1016/j.neuron.2009.09.006.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验