• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

二维关联矩阵的综合分解。

Integrative factorization of bidimensionally linked matrices.

机构信息

Division of Biostatistics, School of Public Health, University of Minnesota, Minneapolis, Minnesota.

出版信息

Biometrics. 2020 Mar;76(1):61-74. doi: 10.1111/biom.13141. Epub 2019 Nov 10.

DOI:10.1111/biom.13141
PMID:31444786
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7036334/
Abstract

Advances in molecular "omics" technologies have motivated new methodologies for the integration of multiple sources of high-content biomedical data. However, most statistical methods for integrating multiple data matrices only consider data shared vertically (one cohort on multiple platforms) or horizontally (different cohorts on a single platform). This is limiting for data that take the form of bidimensionally linked matrices (eg, multiple cohorts measured on multiple platforms), which are increasingly common in large-scale biomedical studies. In this paper, we propose bidimensional integrative factorization (BIDIFAC) for integrative dimension reduction and signal approximation of bidimensionally linked data matrices. Our method factorizes data into (a) globally shared, (b) row-shared, (c) column-shared, and (d) single-matrix structural components, facilitating the investigation of shared and unique patterns of variability. For estimation, we use a penalized objective function that extends the nuclear norm penalization for a single matrix. As an alternative to the complicated rank selection problem, we use results from the random matrix theory to choose tuning parameters. We apply our method to integrate two genomics platforms (messenger RNA and microRNA expression) across two sample cohorts (tumor samples and normal tissue samples) using the breast cancer data from the Cancer Genome Atlas. We provide R code for fitting BIDIFAC, imputing missing values, and generating simulated data.

摘要

分子“组学”技术的进步推动了整合多源高内涵生物医学数据的新方法的发展。然而,大多数整合多个数据矩阵的统计方法仅考虑垂直方向(一个队列在多个平台上)或水平方向(单个平台上的不同队列)共享的数据。对于采用二维链接矩阵形式的数据(例如,在多个平台上测量的多个队列),这是有限的,这种数据在大型生物医学研究中越来越常见。在本文中,我们提出了二维综合因子分析(BIDIFAC),用于二维链接数据矩阵的综合降维和信号逼近。我们的方法将数据分解为(a)全局共享、(b)行共享、(c)列共享和(d)单个矩阵结构组件,便于研究共享和独特的变异性模式。对于估计,我们使用扩展了单个矩阵的核范数惩罚的惩罚目标函数。作为复杂的秩选择问题的替代方案,我们使用随机矩阵理论的结果来选择调整参数。我们使用来自癌症基因组图谱的乳腺癌数据,将两个基因组学平台(信使 RNA 和 microRNA 表达)整合到两个样本队列(肿瘤样本和正常组织样本)中,并应用我们的方法。我们提供了用于拟合 BIDIFAC、插补缺失值和生成模拟数据的 R 代码。

相似文献

1
Integrative factorization of bidimensionally linked matrices.二维关联矩阵的综合分解。
Biometrics. 2020 Mar;76(1):61-74. doi: 10.1111/biom.13141. Epub 2019 Nov 10.
2
BIDIMENSIONAL LINKED MATRIX FACTORIZATION FOR PAN-OMICS PAN-CANCER ANALYSIS.用于泛组学全癌分析的二维链接矩阵分解
Ann Appl Stat. 2022 Mar;16(1):193-215. doi: 10.1214/21-AOAS1495. Epub 2022 Mar 28.
3
A hierarchical spike-and-slab model for pan-cancer survival using pan-omic data.基于泛基因组数据的泛癌生存的层次尖峰-哑块模型。
BMC Bioinformatics. 2022 Jun 17;23(1):235. doi: 10.1186/s12859-022-04770-3.
4
Linked matrix factorization.链接矩阵分解
Biometrics. 2019 Jun;75(2):582-592. doi: 10.1111/biom.13010. Epub 2019 Apr 2.
5
Integrative, multi-omics, analysis of blood samples improves model predictions: applications to cancer.整合多组学分析血液样本可改善模型预测:在癌症中的应用。
BMC Bioinformatics. 2021 Aug 5;22(1):395. doi: 10.1186/s12859-021-04296-0.
6
A penalized matrix decomposition, with applications to sparse principal components and canonical correlation analysis.一种惩罚矩阵分解及其在稀疏主成分分析和典型相关分析中的应用。
Biostatistics. 2009 Jul;10(3):515-34. doi: 10.1093/biostatistics/kxp008. Epub 2009 Apr 17.
7
DeepMF: deciphering the latent patterns in omics profiles with a deep learning method.DeepMF:一种利用深度学习方法解析组学图谱中潜在模式的方法。
BMC Bioinformatics. 2019 Dec 27;20(Suppl 23):648. doi: 10.1186/s12859-019-3291-6.
8
PIntMF: Penalized Integrative Matrix Factorization method for multi-omics data.PIntMF:用于多组学数据的惩罚性整合矩阵分解方法
Bioinformatics. 2022 Jan 27;38(4):900-907. doi: 10.1093/bioinformatics/btab786.
9
Multiple augmented reduced rank regression for pan-cancer analysis.多组增强降秩回归分析泛癌数据。
Biometrics. 2024 Jan 29;80(1). doi: 10.1093/biomtc/ujad002.
10
A non-negative matrix factorization method for detecting modules in heterogeneous omics multi-modal data.一种用于在异质组学多模态数据中检测模块的非负矩阵分解方法。
Bioinformatics. 2016 Jan 1;32(1):1-8. doi: 10.1093/bioinformatics/btv544. Epub 2015 Sep 15.

引用本文的文献

1
Leveraging multimodal neuroimaging and GWAS for identifying modality-level causal pathways to Alzheimer's disease.利用多模态神经影像学和全基因组关联研究来识别阿尔茨海默病的模态水平因果通路。
Imaging Neurosci (Camb). 2025 May 16;3. doi: 10.1162/imag_a_00580. eCollection 2025.
2
Leveraging multimodal neuroimaging and GWAS for identifying modality-level causal pathways to Alzheimer's disease.利用多模态神经影像学和全基因组关联研究来识别阿尔茨海默病的模态水平因果路径。
medRxiv. 2025 Mar 3:2025.02.27.25322897. doi: 10.1101/2025.02.27.25322897.
3
Empirical Bayes Linked Matrix Decomposition.经验贝叶斯链接矩阵分解
Mach Learn. 2024 Oct;113(10):7451-7477. doi: 10.1007/s10994-024-06599-8. Epub 2024 Aug 7.
4
Bootstrap Evaluation of Association Matrices (BEAM) for Integrating Multiple Omics Profiles with Multiple Outcomes.用于整合具有多个结果的多个组学概况的关联矩阵的自助评估(BEAM)
bioRxiv. 2024 Aug 3:2024.07.31.605805. doi: 10.1101/2024.07.31.605805.
5
Bayesian Simultaneous Factorization and Prediction Using Multi-Omic Data.使用多组学数据的贝叶斯同时分解与预测
Comput Stat Data Anal. 2024 Sep;197. doi: 10.1016/j.csda.2024.107974. Epub 2024 Apr 30.
6
HIGH-DIMENSIONAL FACTOR REGRESSION FOR HETEROGENEOUS SUBPOPULATIONS.针对异质子群体的高维因子回归
Stat Sin. 2023 Jan;33(1):27-53. doi: 10.5705/ss.202020.0145.
7
RELIEF: A structured multivariate approach for removal of latent inter-scanner effects.RELIEF:一种用于消除潜在扫描仪间效应的结构化多变量方法。
Imaging Neurosci (Camb). 2023 Aug 30;1:1-16. doi: 10.1162/imag_a_00011. eCollection 2023 Aug 1.
8
Missing data in multi-omics integration: Recent advances through artificial intelligence.多组学整合中的缺失数据:通过人工智能取得的最新进展
Front Artif Intell. 2023 Feb 9;6:1098308. doi: 10.3389/frai.2023.1098308. eCollection 2023.
9
Bayesian Distance Weighted Discrimination.贝叶斯距离加权判别法
J Comput Graph Stat. 2022;31(4):1177-1188. doi: 10.1080/10618600.2022.2069778. Epub 2022 May 26.
10
BIDIMENSIONAL LINKED MATRIX FACTORIZATION FOR PAN-OMICS PAN-CANCER ANALYSIS.用于泛组学全癌分析的二维链接矩阵分解
Ann Appl Stat. 2022 Mar;16(1):193-215. doi: 10.1214/21-AOAS1495. Epub 2022 Mar 28.

本文引用的文献

1
Structural learning and integrative decomposition of multi-view data.多视图数据的结构学习与整合分解
Biometrics. 2019 Dec;75(4):1121-1132. doi: 10.1111/biom.13108. Epub 2019 Sep 15.
2
Linked matrix factorization.链接矩阵分解
Biometrics. 2019 Jun;75(2):582-592. doi: 10.1111/biom.13010. Epub 2019 Apr 2.
3
Generalized integrative principal component analysis for multi-type data with block-wise missing structure.广义整合主成分分析在具有分块缺失结构的多类型数据中的应用。
Biostatistics. 2020 Apr 1;21(2):302-318. doi: 10.1093/biostatistics/kxy052.
4
An Integrated TCGA Pan-Cancer Clinical Data Resource to Drive High-Quality Survival Outcome Analytics.TCGA 泛癌临床数据资源整合,推动高质量生存预后分析。
Cell. 2018 Apr 5;173(2):400-416.e11. doi: 10.1016/j.cell.2018.02.052.
5
Comprehensive analysis of normal adjacent to tumor transcriptomes.肿瘤相邻正常组织转录组的综合分析
Nat Commun. 2017 Oct 20;8(1):1077. doi: 10.1038/s41467-017-01027-z.
6
Prediction With Dimension Reduction of Multiple Molecular Data Sources for Patient Survival.利用多分子数据源降维预测患者生存率
Cancer Inform. 2017 Jul 11;16:1176935117718517. doi: 10.1177/1176935117718517. eCollection 2017.
7
Incorporating covariates into integrated factor analysis of multi-view data.将协变量纳入多视图数据的综合因子分析
Biometrics. 2017 Dec;73(4):1433-1442. doi: 10.1111/biom.12698. Epub 2017 Apr 13.
8
R.JIVE for exploration of multi-source molecular data.用于多源分子数据探索的R.JIVE
Bioinformatics. 2016 Sep 15;32(18):2877-9. doi: 10.1093/bioinformatics/btw324. Epub 2016 Jun 6.
9
Integrative clustering of high-dimensional data with joint and individual clusters.具有联合和单独聚类的高维数据的整合聚类
Biostatistics. 2016 Jul;17(3):537-48. doi: 10.1093/biostatistics/kxw005. Epub 2016 Feb 24.
10
Integrative approaches for large-scale transcriptome-wide association studies.大规模全转录组关联研究的综合方法
Nat Genet. 2016 Mar;48(3):245-52. doi: 10.1038/ng.3506. Epub 2016 Feb 8.