Parlak Fatma, Pham Damon D, Spencer Daniel A, Welsh Robert C, Mejia Amanda F
Department of Statistics, Indiana University, Bloomington, IN, United States.
Department of Psychiatry and Bio-behavioral Sciences, University of California, Los Angeles, Los Angeles, CA, United States.
Front Neurosci. 2023 Jan 4;16:1051424. doi: 10.3389/fnins.2022.1051424. eCollection 2022.
Analysis of task fMRI studies is typically based on using ordinary least squares within a voxel- or vertex-wise linear regression framework known as the general linear model. This use produces estimates and standard errors of the regression coefficients representing amplitudes of task-induced activations. To produce valid statistical inferences, several key statistical assumptions must be met, including that of independent residuals. Since task fMRI residuals often exhibit temporal autocorrelation, it is common practice to perform "prewhitening" to mitigate that dependence. Prewhitening involves estimating the residual correlation structure and then applying a filter to induce residual temporal independence. While theoretically straightforward, a major challenge in prewhitening for fMRI data is accurately estimating the residual autocorrelation at each voxel or vertex of the brain. Assuming a global model for autocorrelation, which is the default in several standard fMRI software tools, may under- or over-whiten in certain areas and produce differential false positive control across the brain. The increasing popularity of multiband acquisitions with faster temporal resolution increases the challenge of effective prewhitening because more complex models are required to accurately capture the strength and structure of autocorrelation. These issues are becoming more critical now because of a trend toward subject-level analysis and inference. In group-average or group-difference analyses, the within-subject residual correlation structure is accounted for implicitly, so inadequate prewhitening is of little real consequence. For individual subject inference, however, accurate prewhitening is crucial to avoid inflated or spatially variable false positive rates.
In this paper, we first thoroughly examine the patterns, sources and strength of residual autocorrelation in multiband task fMRI data. Second, we evaluate the ability of different autoregressive (AR) model-based prewhitening strategies to effectively mitigate autocorrelation and control false positives. We consider two main factors: the choice of AR model order and the level of spatial regularization of AR model coefficients, ranging from local smoothing to global averaging. We also consider determining the AR model order optimally at every vertex, but we do not observe an additional benefit of this over the use of higher-order AR models (e.g. (AR(6)). To overcome the computational challenge associated with spatially variable prewhitening, we developed a computationally efficient R implementation using parallelization and fast C++ backend code. This implementation is included in the open source R package BayesfMRI.
We find that residual autocorrelation exhibits marked spatial variance across the cortex and is influenced by many factors including the task being performed, the specific acquisition protocol, mis-modeling of the hemodynamic response function, unmodeled noise due to subject head motion, and systematic individual differences. We also find that local regularization is much more effective than global averaging at mitigating autocorrelation. While increasing the AR model order is also helpful, it has a lesser effect than allowing AR coefficients to vary spatially. We find that prewhitening with an AR(6) model with local regularization is effective at reducing or even eliminating autocorrelation and controlling false positives.
Our analysis revealed dramatic spatial differences in autocorrelation across the cortex. This spatial topology is unique to each session, being influenced by the task being performed, the acquisition technique, various modeling choices, and individual differences. If not accounted for, these differences will result in differential false positive control and power across the cortex and across subjects.
任务功能磁共振成像(fMRI)研究的分析通常基于在体素或顶点级线性回归框架内使用普通最小二乘法,该框架称为一般线性模型。这种方法可得出代表任务诱发激活幅度的回归系数的估计值和标准误差。为了进行有效的统计推断,必须满足几个关键的统计假设,包括独立残差的假设。由于任务fMRI残差通常表现出时间自相关性,因此通常会进行“预白化”以减轻这种依赖性。预白化包括估计残差相关结构,然后应用滤波器以诱导残差时间独立性。虽然从理论上讲很简单,但fMRI数据预白化的一个主要挑战是准确估计大脑每个体素或顶点处的残差自相关性。假设自相关性的全局模型(这是几种标准fMRI软件工具的默认设置)可能会在某些区域过度或不足白化,并在整个大脑中产生不同的假阳性控制。随着具有更快时间分辨率的多频段采集越来越流行,有效的预白化挑战也在增加,因为需要更复杂的模型来准确捕捉自相关性的强度和结构。由于目前有向个体水平分析和推断发展的趋势,这些问题变得更加关键。在组平均或组差异分析中,个体内部的残差相关结构被隐含地考虑在内,因此预白化不足几乎没有实际影响。然而,对于个体受试者推断,准确的预白化对于避免过高或空间可变的假阳性率至关重要。
在本文中,我们首先深入研究多频段任务fMRI数据中残差自相关性的模式、来源和强度。其次,我们评估基于不同自回归(AR)模型的预白化策略有效减轻自相关性和控制假阳性的能力。我们考虑两个主要因素:AR模型阶数的选择以及AR模型系数的空间正则化水平,范围从局部平滑到全局平均。我们还考虑在每个顶点处最优地确定AR模型阶数,但我们没有观察到这样做比使用高阶AR模型(例如AR(6))有额外的好处。为了克服与空间可变预白化相关的计算挑战,我们使用并行化和快速C++后端代码开发了一种计算效率高的R实现。此实现包含在开源R包BayesfMRI中。
我们发现残差自相关性在整个皮层中表现出明显的空间差异,并且受到许多因素的影响,包括正在执行的任务、特定的采集协议、血流动力学响应函数的错误建模、由于受试者头部运动导致的未建模噪声以及个体差异。我们还发现局部正则化在减轻自相关性方面比全局平均更有效。虽然增加AR模型阶数也有帮助,但它的效果不如允许AR系数在空间上变化。我们发现使用具有局部正则化的AR(6)模型进行预白化在减少甚至消除自相关性和控制假阳性方面是有效的。
我们的分析揭示了整个皮层中自相关性的巨大空间差异。这种空间拓扑结构对于每个实验都是独特的,受到正在执行的任务、采集技术、各种建模选择和个体差异的影响。如果不加以考虑,这些差异将导致整个皮层和不同受试者之间不同的假阳性控制和检验效能。