用于多模态数据集成的广义液体关联分析

Generalized Liquid Association Analysis for Multimodal Data Integration.

作者信息

Li Lexin, Zeng Jing, Zhang Xin

机构信息

University of California at Berkeley.

Florida State University.

出版信息

J Am Stat Assoc. 2023;118(543):1984-1996. doi: 10.1080/01621459.2021.2024437. Epub 2022 Mar 31.

DOI:10.1080/01621459.2021.2024437

PMID:38099062

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10720690/

Abstract

Multimodal data are now prevailing in scientific research. One of the central questions in multimodal integrative analysis is to understand how two data modalities associate and interact with each other given another modality or demographic variables. The problem can be formulated as studying the associations among three sets of random variables, a question that has received relatively less attention in the literature. In this article, we propose a novel generalized liquid association analysis method, which offers a new and unique angle to this important class of problems of studying three-way associations. We extend the notion of liquid association of Li (2002) from the univariate setting to the sparse, multivariate, and high-dimensional setting. We establish a population dimension reduction model, transform the problem to sparse Tucker decomposition of a three-way tensor, and develop a higher-order orthogonal iteration algorithm for parameter estimation. We derive the non-asymptotic error bound and asymptotic consistency of the proposed estimator, while allowing the variable dimensions to be larger than and diverge with the sample size. We demonstrate the efficacy of the method through both simulations and a multimodal neuroimaging application for Alzheimer's disease research.

摘要

多模态数据如今在科学研究中很普遍。多模态综合分析的核心问题之一是，在给定另一种模态或人口统计学变量的情况下，理解两种数据模态如何相互关联和相互作用。这个问题可以表述为研究三组随机变量之间的关联，这一问题在文献中受到的关注相对较少。在本文中，我们提出了一种新颖的广义液体关联分析方法，该方法为研究三元关联这一重要类别的问题提供了一个全新且独特的视角。我们将Li（2002）的液体关联概念从单变量设置扩展到稀疏、多变量和高维设置。我们建立了一个总体降维模型，将问题转化为一个三阶张量的稀疏塔克分解，并开发了一种用于参数估计的高阶正交迭代算法。我们推导了所提出估计器的非渐近误差界和渐近一致性，同时允许变量维度大于样本量并随样本量发散。我们通过模拟和用于阿尔茨海默病研究的多模态神经成像应用证明了该方法的有效性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/61ec/10720690/1deb863a5a9e/nihms-1776217-f0001.jpg

相似文献

Generalized Liquid Association Analysis for Multimodal Data Integration.用于多模态数据集成的广义液体关联分析

J Am Stat Assoc. 2023;118(543):1984-1996. doi: 10.1080/01621459.2021.2024437. Epub 2022 Mar 31.

Orthogonalized Kernel Debiased Machine Learning for Multimodal Data Analysis.用于多模态数据分析的正交化核去偏机器学习

J Am Stat Assoc. 2023;118(543):1796-1810. doi: 10.1080/01621459.2021.2013851. Epub 2022 Feb 3.

Multimodal neuroimaging data integration and pathway analysis.多模态神经影像学数据整合与通路分析。

Biometrics. 2021 Sep;77(3):879-889. doi: 10.1111/biom.13351. Epub 2020 Aug 20.

Sequential Pathway Inference for Multimodal Neuroimaging Analysis.用于多模态神经影像分析的序列通路推断

Stat. 2022 Dec;11(1). doi: 10.1002/sta4.433. Epub 2021 Oct 15.

Multivariate Temporal Point Process Regression.多元时间点过程回归

J Am Stat Assoc. 2023;118(542):830-845. doi: 10.1080/01621459.2021.1955690. Epub 2021 Sep 1.

Tucker Tensor Regression and Neuroimaging Analysis.塔克张量回归与神经影像分析

Stat Biosci. 2018 Dec;10(3):520-545. doi: 10.1007/s12561-018-9215-6. Epub 2018 Mar 7.

Integrative Factor Regression and Its Inference for Multimodal Data Analysis.多模态数据分析的综合因子回归及其推断

J Am Stat Assoc. 2022;117(540):2207-2221. doi: 10.1080/01621459.2021.1914635. Epub 2021 May 20.

Generalized Connectivity Matrix Response Regression with Applications in Brain Connectivity Studies.广义连通性矩阵响应回归及其在脑连通性研究中的应用

J Comput Graph Stat. 2023;32(1):252-262. doi: 10.1080/10618600.2022.2074434. Epub 2022 Jun 2.

Sufficient dimension reduction with simultaneous estimation of effective dimensions for time-to-event data.用于生存时间数据的有效维度同时估计的充分降维

Stat Sin. 2020 Jul;30(3):1285-1311. doi: 10.5705/ss.202017.0550.

Optimal Sparse Singular Value Decomposition for High-Dimensional High-Order Data.高维高阶数据的最优稀疏奇异值分解

J Am Stat Assoc. 2019;114(528):1708-1725. doi: 10.1080/01621459.2018.1527227. Epub 2019 Mar 20.

本文引用的文献

Optimal Sparse Singular Value Decomposition for High-Dimensional High-Order Data.高维高阶数据的最优稀疏奇异值分解

J Am Stat Assoc. 2019;114(528):1708-1725. doi: 10.1080/01621459.2018.1527227. Epub 2019 Mar 20.

Simultaneous Covariance Inference for Multimodal Integrative Analysis.用于多模态整合分析的同步协方差推断

J Am Stat Assoc. 2020;115(531):1279-1291. doi: 10.1080/01621459.2019.1623040. Epub 2019 Jun 28.

Sparse and Low-rank Tensor Estimation via Cubic Sketchings.基于三次草图的稀疏和低秩张量估计

IEEE Trans Inf Theory. 2020 Sep;66(9):5927-5964. doi: 10.1109/tit.2020.2982499. Epub 2020 Mar 23.

D-CCA: A Decomposition-based Canonical Correlation Analysis for High-Dimensional Datasets.D-CCA：一种用于高维数据集的基于分解的典型相关分析

J Am Stat Assoc. 2020;115(529):292-306. doi: 10.1080/01621459.2018.1543599. Epub 2019 Apr 11.

Spread of pathological tau proteins through communicating neurons in human Alzheimer's disease.病理性 tau 蛋白在人类阿尔茨海默病中通过神经元间的传递而扩散。

Nat Commun. 2020 May 26;11(1):2612. doi: 10.1038/s41467-020-15701-2.

An iterative penalized least squares approach to sparse canonical correlation analysis.一种用于稀疏典型相关分析的迭代惩罚最小二乘法。

Biometrics. 2019 Sep;75(3):734-744. doi: 10.1111/biom.13043. Epub 2019 Apr 9.

A new dynamic correlation algorithm reveals novel functional aspects in single cell and bulk RNA-seq data.一种新的动态相关算法揭示了单细胞和批量 RNA-seq 数据中的新功能方面。

PLoS Comput Biol. 2018 Aug 6;14(8):e1006391. doi: 10.1371/journal.pcbi.1006391. eCollection 2018 Aug.

Exploring patterns enriched in a dataset with contrastive principal component analysis.用对比主成分分析探索数据集内的模式富集。

Nat Commun. 2018 May 30;9(1):2134. doi: 10.1038/s41467-018-04608-8.

Amyloid-β plaques enhance Alzheimer's brain tau-seeded pathologies by facilitating neuritic plaque tau aggregation.淀粉样蛋白-β斑块通过促进神经突斑块 tau 聚集增强阿尔茨海默病大脑 tau 引发的病理学变化。

Nat Med. 2018 Jan;24(1):29-38. doi: 10.1038/nm.4443. Epub 2017 Dec 4.

Incorporating covariates into integrated factor analysis of multi-view data.将协变量纳入多视图数据的综合因子分析

Biometrics. 2017 Dec;73(4):1433-1442. doi: 10.1111/biom.12698. Epub 2017 Apr 13.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

用于多模态数据集成的广义液体关联分析

Generalized Liquid Association Analysis for Multimodal Data Integration.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

本文引用的文献