Mejia Amanda F, Nebel Mary Beth, Wang Yikai, Caffo Brian S, Guo Ying
Department of Statistics, Indiana University, Bloomington, IN 47408.
Center for Neurodevelopmental and Imaging Research, Kennedy Krieger Institute, Baltimore, MD 21205.
J Am Stat Assoc. 2020;115(531):1151-1177. doi: 10.1080/01621459.2019.1679638. Epub 2019 Nov 21.
Large brain imaging databases contain a wealth of information on brain organization in the populations they target, and on individual variability. While such databases have been used to study group-level features of populations directly, they are currently underutilized as a resource to inform single-subject analysis. Here, we propose leveraging the information contained in large functional magnetic resonance imaging (fMRI) databases by establishing population priors to employ in an empirical Bayesian framework. We focus on estimation of brain networks as source signals in independent component analysis (ICA). We formulate a hierarchical "template" ICA model where source signals-including known population brain networks and subject-specific signals-are represented as latent variables. For estimation, we derive an expectation maximization (EM) algorithm having an explicit solution. However, as this solution is computationally intractable, we also consider an approximate subspace algorithm and a faster two-stage approach. Through extensive simulation studies, we assess performance of both methods and compare with dual regression, a popular but ad-hoc method. The two proposed algorithms have similar performance, and both dramatically outperform dual regression. We also conduct a reliability study utilizing the Human Connectome Project and find that template ICA achieves substantially better performance than dual regression, achieving 75-250% higher intra-subject reliability.
大型脑成像数据库包含了大量关于其目标人群脑组织结构以及个体变异性的信息。虽然此类数据库已被用于直接研究人群的群体水平特征,但目前作为一种为单受试者分析提供信息的资源,它们并未得到充分利用。在此,我们建议通过建立群体先验信息来利用大型功能磁共振成像(fMRI)数据库中包含的信息,并将其应用于经验贝叶斯框架。我们专注于在独立成分分析(ICA)中估计作为源信号的脑网络。我们构建了一个分层的“模板”ICA模型,其中源信号(包括已知的群体脑网络和特定受试者的信号)被表示为潜在变量。为了进行估计,我们推导了一种具有显式解的期望最大化(EM)算法。然而,由于该解在计算上难以处理,我们还考虑了一种近似子空间算法和一种更快的两阶段方法。通过广泛的模拟研究,我们评估了这两种方法的性能,并与一种流行但临时的方法——双重回归进行了比较。所提出的两种算法具有相似的性能,并且都显著优于双重回归。我们还利用人类连接体项目进行了一项可靠性研究,发现模板ICA的性能比双重回归有显著提升,受试者内可靠性提高了75 - 250%。