Badwal Markus W, Bergmann Johanna, Roth Johannes H R, Doeller Christian F, Hebart Martin N
Department of Psychology, Max Planck Institute for Human Cognitive and Brain Sciences, Leipzig 04103, Germany
Vision & Computational Cognition Group, Max Planck Institute for Human Cognitive and Brain Sciences, Leipzig 04103, Germany.
J Neurosci. 2025 Jan 15;45(3):e0936242024. doi: 10.1523/JNEUROSCI.0936-24.2024.
Humans can easily abstract incoming visual information into discrete semantic categories. Previous research employing functional MRI (fMRI) in humans has identified cortical organizing principles that allow not only for coarse-scale distinctions such as animate versus inanimate objects but also more fine-grained distinctions at the level of individual objects. This suggests that fMRI carries rather fine-grained information about individual objects. However, most previous work investigating fine-grained category representations either additionally included coarse-scale category comparisons of objects, which confounds fine-grained and coarse-scale distinctions, or only used a single exemplar of each object, which confounds visual and semantic information. To address these challenges, here we used multisession human fMRI (female and male) paired with a broad yet homogenous stimulus class of 48 terrestrial mammals, with two exemplars per mammal. Multivariate decoding and representational similarity analysis revealed high image-specific reliability in low- and high-level visual regions, indicating stable representational patterns at the image level. In contrast, analyses across exemplars of the same animal yielded only small effects in the lateral occipital complex (LOC), indicating rather subtle category effects in this region. Variance partitioning with a deep neural network and shape model showed that across-exemplar effects in the early visual cortex were largely explained by low-level visual appearance, while representations in LOC appeared to also contain higher category-specific information. These results suggest that representations typically measured with fMRI are dominated by image-specific visual or coarse-grained category information but indicate that commonly employed fMRI protocols may reveal subtle yet reliable distinctions between individual objects.
人类能够轻松地将传入的视觉信息抽象为离散的语义类别。先前在人类中使用功能磁共振成像(fMRI)的研究已经确定了皮层组织原则,这些原则不仅允许进行诸如 animate 与 inanimate 对象之类的粗略区分,还能在单个对象层面进行更细粒度的区分。这表明 fMRI 携带了关于单个对象的相当细粒度的信息。然而,以前大多数研究细粒度类别表征的工作,要么额外包括了对象的粗粒度类别比较,这混淆了细粒度和粗粒度的区分,要么只使用了每个对象的单个示例,这混淆了视觉和语义信息。为了应对这些挑战,我们在此使用了多会话人类 fMRI(男性和女性),并搭配了一个广泛但同质的刺激类别,即 48 种陆生哺乳动物,每种哺乳动物有两个示例。多变量解码和表征相似性分析显示,在低级和高级视觉区域中存在高度的图像特异性可靠性,表明在图像层面存在稳定的表征模式。相比之下,对同一动物的不同示例进行分析,在枕外侧复合体(LOC)中只产生了很小的影响,表明该区域的类别效应相当微妙。使用深度神经网络和形状模型进行方差划分表明,早期视觉皮层中的跨示例效应在很大程度上由低级视觉外观解释,而 LOC 中的表征似乎也包含更高的类别特异性信息。这些结果表明,通常用 fMRI 测量的表征主要由图像特异性视觉或粗粒度类别信息主导,但表明常用的 fMRI 协议可能揭示单个对象之间微妙但可靠的区别。