Department of Biostatistics, Epidemiology, and Informatics, Penn Statistics in Imaging and Visualization Center, University of Pennsylvania, Philadelphia, Pennsylvania.
Department of Radiology, Center for Biomedical Image Computing and Analytics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania.
Biometrics. 2020 Mar;76(1):257-269. doi: 10.1111/biom.13123. Epub 2019 Sep 30.
The field of neuroimaging dedicated to mapping connections in the brain is increasingly being recognized as key for understanding neurodevelopment and pathology. Networks of these connections are quantitatively represented using complex structures, including matrices, functions, and graphs, which require specialized statistical techniques for estimation and inference about developmental and disorder-related changes. Unfortunately, classical statistical testing procedures are not well suited to high-dimensional testing problems. In the context of global or regional tests for differences in neuroimaging data, traditional analysis of variance (ANOVA) is not directly applicable without first summarizing the data into univariate or low-dimensional features, a process that might mask the salient features of high-dimensional distributions. In this work, we consider a general framework for two-sample testing of complex structures by studying generalized within-group and between-group variances based on distances between complex and potentially high-dimensional observations. We derive an asymptotic approximation to the null distribution of the ANOVA test statistic, and conduct simulation studies with scalar and graph outcomes to study finite sample properties of the test. Finally, we apply our test to our motivating study of structural connectivity in autism spectrum disorder.
致力于绘制大脑连接的神经影像学领域越来越被认为是理解神经发育和病理学的关键。这些连接的网络使用复杂的结构进行定量表示,包括矩阵、函数和图形,这些结构需要专门的统计技术来估计和推断与发育和障碍相关的变化。不幸的是,经典的统计检验程序并不适合高维检验问题。在神经影像学数据差异的全局或区域检验的背景下,如果不首先将数据汇总为单变量或低维特征,传统的方差分析 (ANOVA) 就无法直接应用,这个过程可能会掩盖高维分布的显著特征。在这项工作中,我们通过研究基于复杂和潜在高维观测值之间距离的广义组内和组间方差,考虑了一种用于复杂结构两样本检验的一般框架。我们推导出了 ANOVA 检验统计量的渐近近似零分布,并使用标量和图形结果进行了模拟研究,以研究该检验的有限样本性质。最后,我们将我们的检验应用于我们对自闭症谱系障碍结构连接的动机研究。