Neuroscience Institute, Carnegie Mellon University, Pittsburgh, Pennsylvania 15213
Department of Psychology.
J Neurosci. 2023 May 31;43(22):4144-4161. doi: 10.1523/JNEUROSCI.1822-22.2023. Epub 2023 May 1.
Midlevel features, such as contour and texture, provide a computational link between low- and high-level visual representations. Although the nature of midlevel representations in the brain is not fully understood, past work has suggested a texture statistics model, called the P-S model (Portilla and Simoncelli, 2000), is a candidate for predicting neural responses in areas V1-V4 as well as human behavioral data. However, it is not currently known how well this model accounts for the responses of higher visual cortex to natural scene images. To examine this, we constructed single-voxel encoding models based on P-S statistics and fit the models to fMRI data from human subjects (both sexes) from the Natural Scenes Dataset (Allen et al., 2022). We demonstrate that the texture statistics encoding model can predict the held-out responses of individual voxels in early retinotopic areas and higher-level category-selective areas. The ability of the model to reliably predict signal in higher visual cortex suggests that the representation of texture statistics features is widespread throughout the brain. Furthermore, using variance partitioning analyses, we identify which features are most uniquely predictive of brain responses and show that the contributions of higher-order texture features increase from early areas to higher areas on the ventral and lateral surfaces. We also demonstrate that patterns of sensitivity to texture statistics can be used to recover broad organizational axes within visual cortex, including dimensions that capture semantic image content. These results provide a key step forward in characterizing how midlevel feature representations emerge hierarchically across the visual system. Intermediate visual features, like texture, play an important role in cortical computations and may contribute to tasks like object and scene recognition. Here, we used a texture model proposed in past work to construct encoding models that predict the responses of neural populations in human visual cortex (measured with fMRI) to natural scene stimuli. We show that responses of neural populations at multiple levels of the visual system can be predicted by this model, and that the model is able to reveal an increase in the complexity of feature representations from early retinotopic cortex to higher areas of ventral and lateral visual cortex. These results support the idea that texture-like representations may play a broad underlying role in visual processing.
中层特征,如轮廓和纹理,为低水平和高水平视觉表示之间提供了计算联系。尽管大脑中层表示的性质尚未完全理解,但过去的工作表明,一种纹理统计模型,称为 P-S 模型(Portilla 和 Simoncelli,2000),是预测 V1-V4 区域以及人类行为数据中神经反应的候选模型。然而,目前尚不清楚该模型在多大程度上解释了更高视觉皮层对自然场景图像的反应。为了研究这一点,我们基于 P-S 统计构建了单像素编码模型,并将模型拟合到来自自然场景数据集(Allen 等人,2022 年)的人类受试者(男女)的 fMRI 数据中。我们证明,纹理统计编码模型可以预测早期视域区域和高级类别选择区域中单个体素的保留反应。该模型能够可靠地预测高级视觉皮层的信号,这表明纹理统计特征的表示在整个大脑中广泛存在。此外,使用方差分解分析,我们确定了哪些特征对大脑反应最具独特预测性,并表明高阶纹理特征的贡献从早期区域到腹侧和外侧表面的较高区域逐渐增加。我们还证明,对纹理统计的敏感性模式可用于恢复视觉皮层内的广泛组织轴,包括捕获语义图像内容的维度。这些结果在描述中层特征表示如何在整个视觉系统中分层出现方面迈出了关键的一步。中间视觉特征,如纹理,在皮质计算中起着重要作用,并且可能有助于对象和场景识别等任务。在这里,我们使用过去工作中提出的纹理模型来构建编码模型,该模型可以预测自然场景刺激对人类视觉皮层(通过 fMRI 测量)中神经群体的反应。我们表明,该模型可以预测多个视觉系统水平的神经群体的反应,并且该模型能够揭示从早期视域皮层到腹侧和外侧视觉皮层的较高区域,特征表示的复杂性增加。这些结果支持这样的观点,即类似纹理的表示可能在视觉处理中起着广泛的基础作用。