• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

多模态数据分析的综合因子回归及其推断

Integrative Factor Regression and Its Inference for Multimodal Data Analysis.

作者信息

Li Quefeng, Li Lexin

机构信息

University of North Carolina, Chapel Hill and University of California, Berkeley.

出版信息

J Am Stat Assoc. 2022;117(540):2207-2221. doi: 10.1080/01621459.2021.1914635. Epub 2021 May 20.

DOI:10.1080/01621459.2021.1914635
PMID:36793370
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9928172/
Abstract

Multimodal data, where different types of data are collected from the same subjects, are fast emerging in a large variety of scientific applications. Factor analysis is commonly used in integrative analysis of multimodal data, and is particularly useful to overcome the curse of high dimensionality and high correlations. However, there is little work on statistical inference for factor analysis based supervised modeling of multimodal data. In this article, we consider an integrative linear regression model that is built upon the latent factors extracted from multimodal data. We address three important questions: how to infer the significance of one data modality given the other modalities in the model; how to infer the significance of a combination of variables from one modality or across different modalities; and how to quantify the contribution, measured by the goodness-of-fit, of one data modality given the others. When answering each question, we explicitly characterize both the benefit and the extra cost of factor analysis. Those questions, to our knowledge, have not yet been addressed despite wide use of factor analysis in integrative multimodal analysis, and our proposal bridges an important gap. We study the empirical performance of our methods through simulations, and further illustrate with a multimodal neuroimaging analysis.

摘要

多模态数据是指从同一受试者收集的不同类型的数据,在各种各样的科学应用中迅速兴起。因子分析常用于多模态数据的综合分析,尤其有助于克服高维度和高相关性的问题。然而,基于多模态数据的因子分析监督建模的统计推断方面的工作却很少。在本文中,我们考虑一个基于从多模态数据中提取的潜在因子构建的综合线性回归模型。我们解决三个重要问题:在模型中给定其他模态的情况下,如何推断一种数据模态的显著性;如何推断来自一种模态或跨不同模态的变量组合的显著性;以及如何在给定其他数据模态的情况下,量化一种数据模态以拟合优度衡量的贡献。在回答每个问题时,我们明确刻画了因子分析的益处和额外成本。据我们所知,尽管因子分析在综合多模态分析中被广泛使用,但这些问题尚未得到解决,我们的提议填补了一个重要空白。我们通过模拟研究了我们方法的实证性能,并通过多模态神经影像分析进一步进行了说明。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3aeb/9928172/02d4d0b31c8a/nihms-1769026-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3aeb/9928172/ed02b3fc95ac/nihms-1769026-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3aeb/9928172/67aa41163ceb/nihms-1769026-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3aeb/9928172/02d4d0b31c8a/nihms-1769026-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3aeb/9928172/ed02b3fc95ac/nihms-1769026-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3aeb/9928172/67aa41163ceb/nihms-1769026-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3aeb/9928172/02d4d0b31c8a/nihms-1769026-f0003.jpg

相似文献

1
Integrative Factor Regression and Its Inference for Multimodal Data Analysis.多模态数据分析的综合因子回归及其推断
J Am Stat Assoc. 2022;117(540):2207-2221. doi: 10.1080/01621459.2021.1914635. Epub 2021 May 20.
2
Simultaneous Covariance Inference for Multimodal Integrative Analysis.用于多模态整合分析的同步协方差推断
J Am Stat Assoc. 2020;115(531):1279-1291. doi: 10.1080/01621459.2019.1623040. Epub 2019 Jun 28.
3
Multimodal neuroimaging data integration and pathway analysis.多模态神经影像学数据整合与通路分析。
Biometrics. 2021 Sep;77(3):879-889. doi: 10.1111/biom.13351. Epub 2020 Aug 20.
4
Generalized Liquid Association Analysis for Multimodal Data Integration.用于多模态数据集成的广义液体关联分析
J Am Stat Assoc. 2023;118(543):1984-1996. doi: 10.1080/01621459.2021.2024437. Epub 2022 Mar 31.
5
Orthogonalized Kernel Debiased Machine Learning for Multimodal Data Analysis.用于多模态数据分析的正交化核去偏机器学习
J Am Stat Assoc. 2023;118(543):1796-1810. doi: 10.1080/01621459.2021.2013851. Epub 2022 Feb 3.
6
Assessment of inference-making in children using comprehension questions and story retelling: Effect of text modality and a story presentation format.使用理解性问题和故事复述对儿童推理能力的评估:文本模态和故事呈现形式的影响。
Int J Lang Commun Disord. 2021 May;56(3):637-652. doi: 10.1111/1460-6984.12620.
7
Integrative Data Analysis of Multi-Platform Cancer Data with a Multimodal Deep Learning Approach.基于多模态深度学习方法的多平台癌症数据综合数据分析
IEEE/ACM Trans Comput Biol Bioinform. 2015 Jul-Aug;12(4):928-37. doi: 10.1109/TCBB.2014.2377729.
8
Statistical Inferences for Complex Dependence of Multimodal Imaging Data.多模态成像数据复杂依赖性的统计推断
J Am Stat Assoc. 2024;119(546):1486-1499. doi: 10.1080/01621459.2023.2200610. Epub 2023 May 26.
9
Sequential Pathway Inference for Multimodal Neuroimaging Analysis.用于多模态神经影像分析的序列通路推断
Stat. 2022 Dec;11(1). doi: 10.1002/sta4.433. Epub 2021 Oct 15.
10
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.

引用本文的文献

1
Fault diagnosis in electric motors using multi-mode time series and ensemble transformers network.基于多模式时间序列和集成变压器网络的电动机故障诊断
Sci Rep. 2025 Mar 6;15(1):7834. doi: 10.1038/s41598-025-89695-6.
2
Joint and Individual Component Regression.联合与个体成分回归
J Comput Graph Stat. 2024;33(3):763-773. doi: 10.1080/10618600.2023.2284227. Epub 2023 Dec 29.
3
Are Latent Factor Regression and Sparse Regression Adequate?潜在因子回归和稀疏回归是否足够?

本文引用的文献

1
D-CCA: A Decomposition-based Canonical Correlation Analysis for High-Dimensional Datasets.D-CCA:一种用于高维数据集的基于分解的典型相关分析
J Am Stat Assoc. 2020;115(529):292-306. doi: 10.1080/01621459.2018.1543599. Epub 2019 Apr 11.
2
Factor-Adjusted Regularized Model Selection.因子调整正则化模型选择
J Econom. 2020 May;216(1):71-85. doi: 10.1016/j.jeconom.2020.01.006. Epub 2020 Feb 7.
3
LINEAR HYPOTHESIS TESTING FOR HIGH DIMENSIONAL GENERALIZED LINEAR MODELS.高维广义线性模型的线性假设检验
J Am Stat Assoc. 2024;119(546):1076-1088. doi: 10.1080/01621459.2023.2169700. Epub 2023 Feb 14.
4
Orthogonalized Kernel Debiased Machine Learning for Multimodal Data Analysis.用于多模态数据分析的正交化核去偏机器学习
J Am Stat Assoc. 2023;118(543):1796-1810. doi: 10.1080/01621459.2021.2013851. Epub 2022 Feb 3.
5
Uncovering Diverse Mechanistic Spreading Pathways in Disease Progression of Alzheimer's Disease.揭示阿尔茨海默病疾病进展中的多种机制传播途径
J Alzheimers Dis Rep. 2023 Aug 11;7(1):855-872. doi: 10.3233/ADR-230081. eCollection 2023.
6
Aligned deep neural network for integrative analysis with high-dimensional input.用于高维输入的综合分析的对齐深度神经网络。
J Biomed Inform. 2023 Aug;144:104434. doi: 10.1016/j.jbi.2023.104434. Epub 2023 Jun 28.
Ann Stat. 2019 Oct;47(5):2671-2703. doi: 10.1214/18-AOS1761. Epub 2019 Aug 3.
4
Structural learning and integrative decomposition of multi-view data.多视图数据的结构学习与整合分解
Biometrics. 2019 Dec;75(4):1121-1132. doi: 10.1111/biom.13108. Epub 2019 Sep 15.
5
Integrative multi-view regression: Bridging group-sparse and low-rank models.整合多视角回归:连接组稀疏和低秩模型
Biometrics. 2019 Jun;75(2):593-602. doi: 10.1111/biom.13006. Epub 2019 Mar 29.
6
Embracing the Blessing of Dimensionality in Factor Models.拥抱因子模型中维度的福祉。
J Am Stat Assoc. 2018;113(521):380-389. doi: 10.1080/01621459.2016.1256815. Epub 2017 Nov 13.
7
Supervised multiway factorization.监督式多路分解
Electron J Stat. 2018;12(1):1150-1180. doi: 10.1214/18-EJS1421. Epub 2018 Mar 27.
8
Incorporating covariates into integrated factor analysis of multi-view data.将协变量纳入多视图数据的综合因子分析
Biometrics. 2017 Dec;73(4):1433-1442. doi: 10.1111/biom.12698. Epub 2017 Apr 13.
9
A review on machine learning principles for multi-view biological data integration.机器学习原理在多视图生物数据集成中的研究综述。
Brief Bioinform. 2018 Mar 1;19(2):325-340. doi: 10.1093/bib/bbw113.
10
Statistical Methods in Integrative Genomics.整合基因组学中的统计方法
Annu Rev Stat Appl. 2016 Jun;3:181-209. doi: 10.1146/annurev-statistics-041715-033506. Epub 2016 Apr 18.