Wang Bingkai, Luo Xi, Zhao Yi, Caffo Brian
Johns Hopkins Bloomberg School of Public Health, Baltimore, Maryland.
The University of Texas, Health Science Center at Houston School of Public Health, Houston, Texas.
Biometrics. 2021 Dec;77(4):1175-1186. doi: 10.1111/biom.13369. Epub 2020 Oct 10.
We consider the problem of jointly modeling multiple covariance matrices by partial common principal component analysis (PCPCA), which assumes a proportion of eigenvectors to be shared across covariance matrices and the rest to be individual-specific. This paper proposes consistent estimators of the shared eigenvectors in the PCPCA as the number of matrices or the number of samples to estimate each matrix goes to infinity. We prove such asymptotic results without making any assumptions on the ranks of eigenvalues that are associated with the shared eigenvectors. When the number of samples goes to infinity, our results do not require the data to be Gaussian distributed. Furthermore, this paper introduces a sequential testing procedure to identify the number of shared eigenvectors in the PCPCA. In simulation studies, our method shows higher accuracy in estimating the shared eigenvectors than competing methods. Applied to a motor-task functional magnetic resonance imaging data set, our estimator identifies meaningful brain networks that are consistent with current scientific understandings of motor networks during a motor paradigm.
我们考虑通过部分共同主成分分析(PCPCA)对多个协方差矩阵进行联合建模的问题,该方法假定一部分特征向量在协方差矩阵之间共享,其余的则是各个矩阵特有的。本文提出了PCPCA中共享特征向量的一致估计量,当矩阵数量或估计每个矩阵的样本数量趋于无穷大时,这些估计量成立。我们在不对与共享特征向量相关的特征值的秩做任何假设的情况下证明了此类渐近结果。当样本数量趋于无穷大时,我们的结果不要求数据服从高斯分布。此外,本文引入了一种序贯检验程序来确定PCPCA中共享特征向量的数量。在模拟研究中,我们的方法在估计共享特征向量方面比其他竞争方法具有更高的准确性。应用于一个运动任务功能磁共振成像数据集时,我们的估计量识别出了有意义的脑网络,这些网络与当前对运动范式期间运动网络的科学理解一致。