• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于泛组学全癌分析的二维链接矩阵分解

BIDIMENSIONAL LINKED MATRIX FACTORIZATION FOR PAN-OMICS PAN-CANCER ANALYSIS.

作者信息

Lock Eric F, Park Jun Young, Hoadley Katherine A

机构信息

Division of Biostatistics, School of Public Health, University of Minnesota.

Department of Statistical Sciences, Faculty of Arts & Science, University of Toronto.

出版信息

Ann Appl Stat. 2022 Mar;16(1):193-215. doi: 10.1214/21-AOAS1495. Epub 2022 Mar 28.

DOI:10.1214/21-AOAS1495
PMID:35505906
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9060567/
Abstract

Several modern applications require the integration of multiple large data matrices that have shared rows and/or columns. For example, cancer studies that integrate multiple omics platforms across multiple types of cancer, , have extended our knowledge of molecular heterogeneity beyond what was observed in single tumor and single platform studies. However, these studies have been limited by available statistical methodology. We propose a flexible approach to the simultaneous factorization and decomposition of variation across such matrices, BIDIFAC+. BIDIFAC+ decomposes variation into a series of low-rank components that may be shared across any number of row sets (e.g., omics platforms) or column sets (e.g., cancer types). This builds on a growing literature for the factorization and decomposition of linked matrices which has primarily focused on multiple matrices that are linked in one dimension (rows or columns) only. Our objective function extends nuclear norm penalization, is motivated by random matrix theory, gives a unique decomposition under relatively mild conditions, and can be shown to give the mode of a Bayesian posterior distribution. We apply BIDIFAC+ to pan-omics pan-cancer data from TCGA, identifying shared and specific modes of variability across different omics platforms and 29 different cancer types.

摘要

一些现代应用需要整合多个具有共享行和/或列的大数据矩阵。例如,整合多种癌症类型的多个组学平台的癌症研究,已经扩展了我们对分子异质性的认识,超出了单肿瘤和单平台研究所观察到的范围。然而,这些研究受到现有统计方法的限制。我们提出了一种灵活的方法,用于同时对这类矩阵进行因子分解和变异分解,即BIDIFAC+。BIDIFAC+将变异分解为一系列低秩分量,这些分量可以在任意数量的行集(例如,组学平台)或列集(例如,癌症类型)之间共享。这建立在不断增长的关于链接矩阵因子分解和分解的文献基础上,这些文献主要关注仅在一个维度(行或列)上链接的多个矩阵。我们的目标函数扩展了核范数惩罚,受随机矩阵理论的启发,在相对温和的条件下给出唯一分解,并且可以证明它给出了贝叶斯后验分布的模式。我们将BIDIFAC+应用于来自TCGA的泛组学泛癌症数据,识别了不同组学平台和29种不同癌症类型之间共享和特定的变异模式。

相似文献

1
BIDIMENSIONAL LINKED MATRIX FACTORIZATION FOR PAN-OMICS PAN-CANCER ANALYSIS.用于泛组学全癌分析的二维链接矩阵分解
Ann Appl Stat. 2022 Mar;16(1):193-215. doi: 10.1214/21-AOAS1495. Epub 2022 Mar 28.
2
Integrative factorization of bidimensionally linked matrices.二维关联矩阵的综合分解。
Biometrics. 2020 Mar;76(1):61-74. doi: 10.1111/biom.13141. Epub 2019 Nov 10.
3
A hierarchical spike-and-slab model for pan-cancer survival using pan-omic data.基于泛基因组数据的泛癌生存的层次尖峰-哑块模型。
BMC Bioinformatics. 2022 Jun 17;23(1):235. doi: 10.1186/s12859-022-04770-3.
4
Linked matrix factorization.链接矩阵分解
Biometrics. 2019 Jun;75(2):582-592. doi: 10.1111/biom.13010. Epub 2019 Apr 2.
5
Multiple augmented reduced rank regression for pan-cancer analysis.多组增强降秩回归分析泛癌数据。
Biometrics. 2024 Jan 29;80(1). doi: 10.1093/biomtc/ujad002.
6
Multiple Augmented Reduced Rank Regression for Pan-Cancer Analysis.用于泛癌分析的多重增强降秩回归
ArXiv. 2023 Aug 30:arXiv:2308.16333v1.
7
Bayesian Simultaneous Factorization and Prediction Using Multi-Omic Data.使用多组学数据的贝叶斯同时分解与预测
Comput Stat Data Anal. 2024 Sep;197. doi: 10.1016/j.csda.2024.107974. Epub 2024 Apr 30.
8
Handling missing rows in multi-omics data integration: multiple imputation in multiple factor analysis framework.多组学数据整合中缺失行的处理:多因素分析框架下的多重填补
BMC Bioinformatics. 2016 Oct 3;17(1):402. doi: 10.1186/s12859-016-1273-5.
9
Survey and comparative assessments of computational multi-omics integrative methods with multiple regulatory networks identifying distinct tumor compositions across pan-cancer data sets.对具有多个调控网络的计算多组学综合方法进行调查和比较评估,以识别泛癌数据集之间不同的肿瘤组成。
Brief Bioinform. 2021 May 20;22(3). doi: 10.1093/bib/bbaa102.
10
Fast dimension reduction and integrative clustering of multi-omics data using low-rank approximation: application to cancer molecular classification.使用低秩近似的多组学数据快速降维和整合聚类:在癌症分子分类中的应用
BMC Genomics. 2015 Dec 1;16:1022. doi: 10.1186/s12864-015-2223-8.

引用本文的文献

1
Leveraging multimodal neuroimaging and GWAS for identifying modality-level causal pathways to Alzheimer's disease.利用多模态神经影像学和全基因组关联研究来识别阿尔茨海默病的模态水平因果通路。
Imaging Neurosci (Camb). 2025 May 16;3. doi: 10.1162/imag_a_00580. eCollection 2025.
2
Leveraging multimodal neuroimaging and GWAS for identifying modality-level causal pathways to Alzheimer's disease.利用多模态神经影像学和全基因组关联研究来识别阿尔茨海默病的模态水平因果路径。
medRxiv. 2025 Mar 3:2025.02.27.25322897. doi: 10.1101/2025.02.27.25322897.
3
Empirical Bayes Linked Matrix Decomposition.

本文引用的文献

1
Integrative factorization of bidimensionally linked matrices.二维关联矩阵的综合分解。
Biometrics. 2020 Mar;76(1):61-74. doi: 10.1111/biom.13141. Epub 2019 Nov 10.
2
Structural learning and integrative decomposition of multi-view data.多视图数据的结构学习与整合分解
Biometrics. 2019 Dec;75(4):1121-1132. doi: 10.1111/biom.13108. Epub 2019 Sep 15.
3
Linked matrix factorization.链接矩阵分解
经验贝叶斯链接矩阵分解
Mach Learn. 2024 Oct;113(10):7451-7477. doi: 10.1007/s10994-024-06599-8. Epub 2024 Aug 7.
4
Bayesian Simultaneous Factorization and Prediction Using Multi-Omic Data.使用多组学数据的贝叶斯同时分解与预测
Comput Stat Data Anal. 2024 Sep;197. doi: 10.1016/j.csda.2024.107974. Epub 2024 Apr 30.
5
Multiple augmented reduced rank regression for pan-cancer analysis.多组增强降秩回归分析泛癌数据。
Biometrics. 2024 Jan 29;80(1). doi: 10.1093/biomtc/ujad002.
6
RELIEF: A structured multivariate approach for removal of latent inter-scanner effects.RELIEF:一种用于消除潜在扫描仪间效应的结构化多变量方法。
Imaging Neurosci (Camb). 2023 Aug 30;1:1-16. doi: 10.1162/imag_a_00011. eCollection 2023 Aug 1.
7
Missing data in multi-omics integration: Recent advances through artificial intelligence.多组学整合中的缺失数据:通过人工智能取得的最新进展
Front Artif Intell. 2023 Feb 9;6:1098308. doi: 10.3389/frai.2023.1098308. eCollection 2023.
8
Interpretive JIVE: Connections with CCA and an application to brain connectivity.解释性JIVE:与CCA的联系及其在脑连接性中的应用。
Front Neurosci. 2022 Oct 14;16:969510. doi: 10.3389/fnins.2022.969510. eCollection 2022.
9
A hierarchical spike-and-slab model for pan-cancer survival using pan-omic data.基于泛基因组数据的泛癌生存的层次尖峰-哑块模型。
BMC Bioinformatics. 2022 Jun 17;23(1):235. doi: 10.1186/s12859-022-04770-3.
10
Two-stage linked component analysis for joint decomposition of multiple biologically related data sets.两阶段关联成分分析用于联合分解多个具有生物学相关性的数据集。
Biostatistics. 2022 Oct 14;23(4):1200-1217. doi: 10.1093/biostatistics/kxac005.
Biometrics. 2019 Jun;75(2):582-592. doi: 10.1111/biom.13010. Epub 2019 Apr 2.
4
Generalized integrative principal component analysis for multi-type data with block-wise missing structure.广义整合主成分分析在具有分块缺失结构的多类型数据中的应用。
Biostatistics. 2020 Apr 1;21(2):302-318. doi: 10.1093/biostatistics/kxy052.
5
Multi-Omics Factor Analysis-a framework for unsupervised integration of multi-omics data sets.多组学因子分析——一种用于无监督整合多组学数据集的框架。
Mol Syst Biol. 2018 Jun 20;14(6):e8124. doi: 10.15252/msb.20178124.
6
Cell-of-Origin Patterns Dominate the Molecular Classification of 10,000 Tumors from 33 Types of Cancer.起源细胞模式主导了 33 种癌症类型的 10000 个肿瘤的分子分类。
Cell. 2018 Apr 5;173(2):291-304.e6. doi: 10.1016/j.cell.2018.03.022.
7
The Cancer Genome Atlas: Creating Lasting Value beyond Its Data.癌症基因组图谱:在其数据之外创造持久价值。
Cell. 2018 Apr 5;173(2):283-285. doi: 10.1016/j.cell.2018.03.042.
8
Clusternomics: Integrative context-dependent clustering for heterogeneous datasets.聚类组学:针对异构数据集的整合上下文相关聚类
PLoS Comput Biol. 2017 Oct 16;13(10):e1005781. doi: 10.1371/journal.pcbi.1005781. eCollection 2017 Oct.
9
Integrative Sparse -Means With Overlapping Group Lasso in Genomic Applications for Disease Subtype Discovery.用于疾病亚型发现的基因组应用中具有重叠组套索的整合稀疏均值法
Ann Appl Stat. 2017 Jun;11(2):1011-1039. doi: 10.1214/17-AOAS1033. Epub 2017 Jul 20.
10
Prediction With Dimension Reduction of Multiple Molecular Data Sources for Patient Survival.利用多分子数据源降维预测患者生存率
Cancer Inform. 2017 Jul 11;16:1176935117718517. doi: 10.1177/1176935117718517. eCollection 2017.