Li Lexin, Zeng Jing, Zhang Xin
University of California at Berkeley.
Florida State University.
J Am Stat Assoc. 2023;118(543):1984-1996. doi: 10.1080/01621459.2021.2024437. Epub 2022 Mar 31.
Multimodal data are now prevailing in scientific research. One of the central questions in multimodal integrative analysis is to understand how two data modalities associate and interact with each other given another modality or demographic variables. The problem can be formulated as studying the associations among three sets of random variables, a question that has received relatively less attention in the literature. In this article, we propose a novel generalized liquid association analysis method, which offers a new and unique angle to this important class of problems of studying three-way associations. We extend the notion of liquid association of Li (2002) from the univariate setting to the sparse, multivariate, and high-dimensional setting. We establish a population dimension reduction model, transform the problem to sparse Tucker decomposition of a three-way tensor, and develop a higher-order orthogonal iteration algorithm for parameter estimation. We derive the non-asymptotic error bound and asymptotic consistency of the proposed estimator, while allowing the variable dimensions to be larger than and diverge with the sample size. We demonstrate the efficacy of the method through both simulations and a multimodal neuroimaging application for Alzheimer's disease research.
多模态数据如今在科学研究中很普遍。多模态综合分析的核心问题之一是,在给定另一种模态或人口统计学变量的情况下,理解两种数据模态如何相互关联和相互作用。这个问题可以表述为研究三组随机变量之间的关联,这一问题在文献中受到的关注相对较少。在本文中,我们提出了一种新颖的广义液体关联分析方法,该方法为研究三元关联这一重要类别的问题提供了一个全新且独特的视角。我们将Li(2002)的液体关联概念从单变量设置扩展到稀疏、多变量和高维设置。我们建立了一个总体降维模型,将问题转化为一个三阶张量的稀疏塔克分解,并开发了一种用于参数估计的高阶正交迭代算法。我们推导了所提出估计器的非渐近误差界和渐近一致性,同时允许变量维度大于样本量并随样本量发散。我们通过模拟和用于阿尔茨海默病研究的多模态神经成像应用证明了该方法的有效性。