School of Biomedical Engineering & State Key Laboratory of Advanced Medical Materials and Devices, ShanghaiTech University, Shanghai 201210, China.
School of Computer Science and Engineering, University of Electronic Science and Technology of China, Chengdu 611731, China; Department of Computer and Information Science, University of Pennsylvania, Philadelphia, PA 19104, USA.
Sci Bull (Beijing). 2024 Jun 15;69(11):1738-1747. doi: 10.1016/j.scib.2024.02.035. Epub 2024 Feb 29.
Cognitive neuroscience aims to develop computational models that can accurately predict and explain neural responses to sensory inputs in the cortex. Recent studies attempt to leverage the representation power of deep neural networks (DNNs) to predict the brain response and suggest a correspondence between artificial and biological neural networks in their feature representations. However, typical voxel-wise encoding models tend to rely on specific networks designed for computer vision tasks, leading to suboptimal brain-wide correspondence during cognitive tasks. To address this challenge, this work proposes a novel approach that upgrades voxel-wise encoding models through multi-level integration of features from DNNs and information from brain networks. Our approach combines DNN feature-level ensemble learning and brain atlas-level model integration, resulting in significant improvements in predicting whole-brain neural activity during naturalistic video perception. Furthermore, this multi-level integration framework enables a deeper understanding of the brain's neural representation mechanism, accurately predicting the neural response to complex visual concepts. We demonstrate that neural encoding models can be optimized by leveraging a framework that integrates both data-driven approaches and theoretical insights into the functional structure of the cortical networks.
认知神经科学旨在开发能够准确预测和解释皮质感觉输入的神经反应的计算模型。最近的研究试图利用深度神经网络 (DNN) 的表示能力来预测大脑反应,并在其特征表示中暗示人工和生物神经网络之间的对应关系。然而,典型的体素编码模型往往依赖于专为计算机视觉任务设计的特定网络,导致认知任务中大脑整体对应关系不佳。为了解决这个挑战,这项工作提出了一种新的方法,通过从 DNN 和大脑网络中提取特征的多层次集成来升级体素编码模型。我们的方法结合了 DNN 特征级别的集成学习和大脑图谱级别的模型集成,在预测自然视频感知过程中的全脑神经活动方面取得了显著的改进。此外,这个多层次的集成框架使我们能够更深入地了解大脑的神经表示机制,准确预测对复杂视觉概念的神经反应。我们证明,通过利用将数据驱动方法和对皮质网络功能结构的理论见解集成到一个框架中,可以优化神经编码模型。