Department of Psychology, University of Cambridge, Downing Street, Cambridge, CB2 3EB, United Kingdom.
Institute of Electronics, Communications & Information Technology, Queen's University, Belfast, UK.
Sci Rep. 2018 Jul 13;8(1):10636. doi: 10.1038/s41598-018-28865-1.
Recognising an object involves rapid visual processing and activation of semantic knowledge about the object, but how visual processing activates and interacts with semantic representations remains unclear. Cognitive neuroscience research has shown that while visual processing involves posterior regions along the ventral stream, object meaning involves more anterior regions, especially perirhinal cortex. Here we investigate visuo-semantic processing by combining a deep neural network model of vision with an attractor network model of semantics, such that visual information maps onto object meanings represented as activation patterns across features. In the combined model, concept activation is driven by visual input and co-occurrence of semantic features, consistent with neurocognitive accounts. We tested the model's ability to explain fMRI data where participants named objects. Visual layers explained activation patterns in early visual cortex, whereas pattern-information in perirhinal cortex was best explained by later stages of the attractor network, when detailed semantic representations are activated. Posterior ventral temporal cortex was best explained by intermediate stages corresponding to initial semantic processing, when visual information has the greatest influence on the emerging semantic representation. These results provide proof of principle of how a mechanistic model of combined visuo-semantic processing can account for pattern-information in the ventral stream.
识别物体涉及快速的视觉处理和对物体语义知识的激活,但视觉处理如何激活和与语义表示相互作用仍不清楚。认知神经科学研究表明,虽然视觉处理涉及腹侧流的后部区域,但物体的意义涉及更靠前的区域,特别是在眶额皮层。在这里,我们通过将视觉的深度神经网络模型与语义的吸引子网络模型相结合来研究视-语义处理,使得视觉信息映射到作为特征之间激活模式表示的物体意义上。在组合模型中,概念激活由视觉输入和语义特征的共同出现驱动,这与神经认知解释一致。我们测试了该模型解释 fMRI 数据的能力,其中参与者命名物体。视觉层解释了早期视觉皮层中的激活模式,而在吸引子网络的后期阶段,当激活详细的语义表示时,对眶额皮层的模式信息的解释最好。当视觉信息对新兴的语义表示有最大影响时,与初始语义处理相对应的中间阶段可以最好地解释后腹侧颞叶皮层。这些结果提供了一个原理证明,即联合视-语义处理的机制模型如何可以解释腹侧流中的模式信息。