IEEE Trans Med Imaging. 2014 Apr;33(4):925-34. doi: 10.1109/TMI.2014.2298856.
Multi-voxel pattern analysis (MVPA) of functional magnetic resonance imaging (fMRI) data is an emerging approach for probing the neural correlates of cognition. MVPA allows cognitive states to be modeled as distributed patterns of neural activity and classified according to stimulus conditions. In practice, building a robust, generalizable classification model can be challenging because the number of voxels (features) far exceeds the number of stimulus instances/data observations. To avoid model overfitting, there is a need to select informative voxels before building a classification model. In this paper, we propose a robust feature (voxel) selection framework using mutual information (MI) and partial least square regression (PLS) to establish an informativeness index for prioritizing selection of voxels based on the degree of their association to the experimental conditions. We evaluated the robustness of our proposed framework by assessing performance of standard classification algorithms, when combined with our feature selection approach, in a publicly-available fMRI dataset of object-level representation widely used to benchmark MVPA performance (Haxby, 2001). The computational results suggest that our feature selection framework based on MI and PLS drastically improves the classification accuracy relative to those previously reported in the literature. Our results also suggest that highly informative voxels may provide meaningful insight into the functional-anatomic relationship of brain activity and stimulus conditions.
多体素模式分析 (MVPA) 是一种新兴的功能磁共振成像 (fMRI) 数据分析方法,可用于探测认知的神经相关性。MVPA 将认知状态建模为神经活动的分布式模式,并根据刺激条件进行分类。在实践中,构建稳健、可推广的分类模型可能具有挑战性,因为体素(特征)的数量远远超过刺激实例/数据观测的数量。为了避免模型过拟合,需要在构建分类模型之前选择信息丰富的体素。在本文中,我们提出了一种使用互信息 (MI) 和偏最小二乘回归 (PLS) 的稳健特征(体素)选择框架,以建立一个信息指标,根据其与实验条件的关联程度对体素进行优先选择。我们通过评估标准分类算法的性能来评估我们提出的框架的稳健性,当与我们的特征选择方法结合使用时,该算法在一个广泛用于基准 MVPA 性能的公开可用的对象级表示 fMRI 数据集(Haxby,2001)上。计算结果表明,我们基于 MI 和 PLS 的特征选择框架与文献中之前报道的方法相比,大大提高了分类准确性。我们的结果还表明,高信息量的体素可能为大脑活动和刺激条件的功能解剖关系提供有意义的见解。