Department of Psychiatry and Behavioral Sciences, Stanford University School of Medicine, Stanford, CA 94305, USA.
Neuroimage. 2010 Jun;51(2):752-64. doi: 10.1016/j.neuroimage.2010.02.040. Epub 2010 Feb 24.
Multivariate pattern recognition methods are increasingly being used to identify multiregional brain activity patterns that collectively discriminate one cognitive condition or experimental group from another, using fMRI data. The performance of these methods is often limited because the number of regions considered in the analysis of fMRI data is large compared to the number of observations (trials or participants). Existing methods that aim to tackle this dimensionality problem are less than optimal because they either over-fit the data or are computationally intractable. Here, we describe a novel method based on logistic regression using a combination of L1 and L2 norm regularization that more accurately estimates discriminative brain regions across multiple conditions or groups. The L1 norm, computed using a fast estimation procedure, ensures a fast, sparse and generalizable solution; the L2 norm ensures that correlated brain regions are included in the resulting solution, a critical aspect of fMRI data analysis often overlooked by existing methods. We first evaluate the performance of our method on simulated data and then examine its effectiveness in discriminating between well-matched music and speech stimuli. We also compared our procedures with other methods which use either L1-norm regularization alone or support vector machine-based feature elimination. On simulated data, our methods performed significantly better than existing methods across a wide range of contrast-to-noise ratios and feature prevalence rates. On experimental fMRI data, our methods were more effective in selectively isolating a distributed fronto-temporal network that distinguished between brain regions known to be involved in speech and music processing. These findings suggest that our method is not only computationally efficient, but it also achieves the twin objectives of identifying relevant discriminative brain regions and accurately classifying fMRI data.
多变量模式识别方法越来越多地被用于识别多区域脑活动模式,这些模式共同区分一种认知状态或实验组与另一种,使用 fMRI 数据。这些方法的性能通常受到限制,因为在 fMRI 数据分析中考虑的区域数量与观察数量(试验或参与者)相比很大。现有的旨在解决这个维度问题的方法并不理想,因为它们要么过度拟合数据,要么计算上难以处理。在这里,我们描述了一种基于逻辑回归的新方法,该方法结合了 L1 和 L2 范数正则化,可以更准确地估计多个条件或组的判别性脑区。使用快速估计过程计算的 L1 范数确保了快速、稀疏和可推广的解决方案;L2 范数确保了相关的脑区被包含在最终的解决方案中,这是 fMRI 数据分析中经常被现有方法忽略的一个关键方面。我们首先在模拟数据上评估我们的方法的性能,然后检查其在区分匹配良好的音乐和语音刺激方面的有效性。我们还将我们的程序与其他方法进行了比较,这些方法要么单独使用 L1 范数正则化,要么使用基于支持向量机的特征消除。在模拟数据上,我们的方法在广泛的对比噪声比和特征出现率范围内的性能明显优于现有方法。在实验 fMRI 数据上,我们的方法更有效地选择性地隔离了一个分布式额颞网络,该网络区分了已知参与语音和音乐处理的脑区。这些发现表明,我们的方法不仅计算效率高,而且还实现了识别相关判别性脑区和准确分类 fMRI 数据的双重目标。