School of Electrical and Computer Engineering, Purdue University, West Lafayette, IN, USA; Purdue Institute for Integrative Neuroscience, Purdue University, West Lafayette, IN, USA.
Center for Magnetic Resonance Research, Department of Radiology, University of Minnesota Medical School, Minneapolis, MN, USA.
Neuroimage. 2018 Aug 1;176:152-163. doi: 10.1016/j.neuroimage.2018.04.053. Epub 2018 Apr 27.
Recent studies have shown the value of using deep learning models for mapping and characterizing how the brain represents and organizes information for natural vision. However, modeling the relationship between deep learning models and the brain (or encoding models), requires measuring cortical responses to large and diverse sets of natural visual stimuli from single subjects. This requirement limits prior studies to few subjects, making it difficult to generalize findings across subjects or for a population. In this study, we developed new methods to transfer and generalize encoding models across subjects. To train encoding models specific to a target subject, the models trained for other subjects were used as the prior models and were refined efficiently using Bayesian inference with a limited amount of data from the target subject. To train encoding models for a population, the models were progressively trained and updated with incremental data from different subjects. For the proof of principle, we applied these methods to functional magnetic resonance imaging (fMRI) data from three subjects watching tens of hours of naturalistic videos, while a deep residual neural network driven by image recognition was used to model visual cortical processing. Results demonstrate that the methods developed herein provide an efficient and effective strategy to establish both subject-specific and population-wide predictive models of cortical representations of high-dimensional and hierarchical visual features.
最近的研究表明,使用深度学习模型来映射和描述大脑如何表示和组织自然视觉信息具有重要价值。然而,要构建深度学习模型与大脑(或编码模型)之间的关系,需要从单个主体中测量对大量不同自然视觉刺激的皮质反应。这一要求限制了先前的研究只能涉及少数主体,难以在主体间或人群中推广发现。在这项研究中,我们开发了新的方法来在主体间转移和推广编码模型。为了针对目标主体训练特定的编码模型,我们使用来自其他主体的模型作为先验模型,并使用贝叶斯推断对目标主体的少量数据进行高效细化。为了训练人群的编码模型,我们使用不同主体的增量数据逐步训练和更新模型。作为原理验证,我们将这些方法应用于三个主体观看数十小时自然视频的功能磁共振成像 (fMRI) 数据,同时使用受图像识别驱动的深度残差神经网络来模拟视觉皮质处理。结果表明,本文所开发的方法提供了一种高效有效的策略,可以建立针对高维和分层视觉特征的皮质表示的主体特异性和人群普遍性预测模型。