Princeton University, United States.
Princeton University, United States.
Neuroimage. 2018 Oct 15;180(Pt A):223-231. doi: 10.1016/j.neuroimage.2017.06.042. Epub 2017 Jun 23.
Several research groups have shown how to map fMRI responses to the meanings of presented stimuli. This paper presents new methods for doing so when only a natural language annotation is available as the description of the stimulus. We study fMRI data gathered from subjects watching an episode of BBCs Sherlock (Chen et al., 2017), and learn bidirectional mappings between fMRI responses and natural language representations. By leveraging data from multiple subjects watching the same movie, we were able to perform scene classification with 72% accuracy (random guessing would give 4%) and scene ranking with average rank in the top 4% (random guessing would give 50%). The key ingredients underlying this high level of performance are (a) the use of the Shared Response Model (SRM) and its variant SRM-ICA (Chen et al., 2015; Zhang et al., 2016) to aggregate fMRI data from multiple subjects, both of which are shown to be superior to standard PCA in producing low-dimensional representations for the tasks in this paper; (b) a sentence embedding technique adapted from the natural language processing (NLP) literature (Arora et al., 2017) that produces semantic vector representation of the annotations; (c) using previous timestep information in the featurization of the predictor data. These optimizations in how we featurize the fMRI data and text annotations provide a substantial improvement in classification performance, relative to standard approaches.
一些研究小组已经展示了如何将 fMRI 反应映射到呈现刺激的含义上。本文提出了在仅使用自然语言注释作为刺激描述的情况下进行此操作的新方法。我们研究了从观看 BBC 电视剧《神探夏洛克》(Chen 等人,2017)的受试者收集的 fMRI 数据,并学习了 fMRI 反应与自然语言表示之间的双向映射。通过利用多个受试者观看同一部电影的数据,我们能够以 72%的准确率进行场景分类(随机猜测的准确率为 4%),并且平均排名在前 4%(随机猜测的准确率为 50%)。实现这种高水平性能的关键因素是:(a)使用共享响应模型(SRM)及其变体 SRM-ICA(Chen 等人,2015;Zhang 等人,2016)来聚合来自多个受试者的 fMRI 数据,这两种方法都优于标准 PCA,可针对本文任务生成低维表示;(b)一种来自自然语言处理(NLP)文献的句子嵌入技术(Arora 等人,2017),可生成注释的语义向量表示;(c)在预测器数据的特征化中使用前一个时间步信息。这些对我们如何对 fMRI 数据和文本注释进行特征化的优化,与标准方法相比,显著提高了分类性能。