Shen Guohua, Dwivedi Kshitij, Majima Kei, Horikawa Tomoyasu, Kamitani Yukiyasu
Computational Neuroscience Laboratories, Advanced Telecommunications Research Institute International, Kyoto, Japan.
Graduate School of Informatics, Kyoto University, Kyoto, Japan.
Front Comput Neurosci. 2019 Apr 12;13:21. doi: 10.3389/fncom.2019.00021. eCollection 2019.
Deep neural networks (DNNs) have recently been applied successfully to brain decoding and image reconstruction from functional magnetic resonance imaging (fMRI) activity. However, direct training of a DNN with fMRI data is often avoided because the size of available data is thought to be insufficient for training a complex network with numerous parameters. Instead, a pre-trained DNN usually serves as a proxy for hierarchical visual representations, and fMRI data are used to decode individual DNN features of a stimulus image using a simple linear model, which are then passed to a reconstruction module. Here, we directly trained a DNN model with fMRI data and the corresponding stimulus images to build an end-to-end reconstruction model. We accomplished this by training a generative adversarial network with an additional loss term that was defined in high-level feature space (feature loss) using up to 6,000 training data samples (natural images and fMRI responses). The above model was tested on independent datasets and directly reconstructed image using an fMRI pattern as the input. Reconstructions obtained from our proposed method resembled the test stimuli (natural and artificial images) and reconstruction accuracy increased as a function of training-data size. Ablation analyses indicated that the feature loss that we employed played a critical role in achieving accurate reconstruction. Our results show that the end-to-end model can learn a direct mapping between brain activity and perception.
深度神经网络(DNN)最近已成功应用于基于功能磁共振成像(fMRI)活动的脑解码和图像重建。然而,通常避免直接使用fMRI数据训练DNN,因为人们认为可用数据的大小不足以训练具有众多参数的复杂网络。相反,预训练的DNN通常用作分层视觉表征的代理,并且fMRI数据用于使用简单线性模型解码刺激图像的各个DNN特征,然后将这些特征传递到重建模块。在这里,我们直接使用fMRI数据和相应的刺激图像训练DNN模型,以构建端到端的重建模型。我们通过训练一个生成对抗网络来实现这一点,该网络带有一个额外的损失项,该项在高级特征空间(特征损失)中定义,使用了多达6000个训练数据样本(自然图像和fMRI响应)。上述模型在独立数据集上进行了测试,并使用fMRI模式作为输入直接重建图像。从我们提出的方法获得的重建结果类似于测试刺激(自然图像和人工图像),并且重建精度随着训练数据大小的增加而提高。消融分析表明,我们采用的特征损失在实现准确重建中起着关键作用。我们的结果表明,端到端模型可以学习大脑活动与感知之间的直接映射。