Tao Chenyang, Nichols Thomas E, Hua Xue, Ching Christopher R K, Rolls Edmund T, Thompson Paul M, Feng Jianfeng
Centre for Computational Systems Biology and School of Mathematical Sciences, Fudan University, Shanghai, PR China; Department of Computer Science, Warwick University, Coventry, UK.
Department of Statistics, University of Warwick, Coventry, UK.
Neuroimage. 2017 Jan 1;144(Pt A):35-57. doi: 10.1016/j.neuroimage.2016.08.027. Epub 2016 Sep 22.
We propose a generalized reduced rank latent factor regression model (GRRLF) for the analysis of tensor field responses and high dimensional covariates. The model is motivated by the need from imaging-genetic studies to identify genetic variants that are associated with brain imaging phenotypes, often in the form of high dimensional tensor fields. GRRLF identifies from the structure in the data the effective dimensionality of the data, and then jointly performs dimension reduction of the covariates, dynamic identification of latent factors, and nonparametric estimation of both covariate and latent response fields. After accounting for the latent and covariate effects, GRLLF performs a nonparametric test on the remaining factor of interest. GRRLF provides a better factorization of the signals compared with common solutions, and is less susceptible to overfitting because it exploits the effective dimensionality. The generality and the flexibility of GRRLF also allow various statistical models to be handled in a unified framework and solutions can be efficiently computed. Within the field of neuroimaging, it improves the sensitivity for weak signals and is a promising alternative to existing approaches. The operation of the framework is demonstrated with both synthetic datasets and a real-world neuroimaging example in which the effects of a set of genes on the structure of the brain at the voxel level were measured, and the results compared favorably with those from existing approaches.
我们提出了一种广义降秩潜在因子回归模型(GRRLF),用于分析张量场响应和高维协变量。该模型的动机源于成像遗传学研究的需求,即识别与脑成像表型相关的遗传变异,这些表型通常以高维张量场的形式存在。GRRLF从数据结构中识别数据的有效维度,然后联合执行协变量的降维、潜在因子的动态识别以及协变量和潜在响应场的非参数估计。在考虑潜在和协变量效应后,GRLLF对其余感兴趣的因子进行非参数检验。与常见解决方案相比,GRRLF对信号提供了更好的分解,并且由于利用了有效维度,不易受到过拟合的影响。GRRLF的通用性和灵活性还允许在统一框架中处理各种统计模型,并且可以高效地计算解决方案。在神经成像领域,它提高了对弱信号的敏感性,是现有方法的一个有前途的替代方案。通过合成数据集和一个真实世界的神经成像示例展示了该框架的操作,在该示例中测量了一组基因对体素水平脑结构的影响,结果与现有方法相比具有优势。