多组学数据的同步整合提高了癌症驱动模块的识别能力。

Simultaneous Integration of Multi-omics Data Improves the Identification of Cancer Driver Modules.

机构信息

Broad Institute of Harvard and MIT, Cambridge, MA 02142, USA; Department of Pathology and Center for Cancer Research, Massachusetts General Hospital and Harvard Medical School, Boston, MA 02114, USA; Blavatnik School of Computer Science, Tel Aviv University, 69978 Tel Aviv, Israel.

Department of Data Science, Dana-Farber Cancer Institute, Boston, MA 02215, USA; Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA 02115, USA; Department of Stem Cell and Regenerative Biology, Harvard University, Cambridge, MA 02138, USA.

出版信息

Cell Syst. 2019 May 22;8(5):456-466.e5. doi: 10.1016/j.cels.2019.04.005. Epub 2019 May 15.

DOI:10.1016/j.cels.2019.04.005

PMID:31103572

Abstract

The identification of molecular pathways driving cancer progression is a fundamental challenge in cancer research. Most approaches to address it are limited in the number of data types they employ and perform data integration in a sequential manner. Here, we describe ModulOmics, a method to de novo identify cancer driver pathways, or modules, by integrating protein-protein interactions, mutual exclusivity of mutations and copy number alterations, transcriptional coregulation, and RNA coexpression into a single probabilistic model. To efficiently search and score the large space of candidate modules, ModulOmics employs a two-step optimization procedure that combines integer linear programming with stochastic search. Applied across several cancer types, ModulOmics identifies highly functionally connected modules enriched with cancer driver genes, outperforming state-of-the-art methods and demonstrating the power of using multiple omics data types simultaneously. On breast cancer subtypes, ModulOmics proposes unexplored connections supported by an independent patient cohort and independent proteomic and phosphoproteomic datasets.

摘要

鉴定驱动癌症进展的分子途径是癌症研究中的一个基本挑战。大多数解决该问题的方法在其使用的数据类型数量上都存在局限性，并且以顺序方式执行数据集成。在这里，我们描述了一种新的方法 ModulOmics，该方法通过将蛋白质-蛋白质相互作用、突变和拷贝数改变的互斥性、转录核心调控以及 RNA 共表达整合到一个单一的概率模型中，从头鉴定癌症驱动途径或模块。为了有效地搜索和评分候选模块的大型空间，ModulOmics 采用了两步优化过程，将整数线性规划与随机搜索相结合。在几种癌症类型中的应用表明，ModulOmics 鉴定出了高度功能连接的模块，其中富含癌症驱动基因，优于最先进的方法，并证明了同时使用多种组学数据类型的强大功能。在乳腺癌亚型中，ModulOmics 提出了具有独立患者队列以及独立蛋白质组学和磷酸化蛋白质组学数据集支持的探索性连接。

相似文献

Simultaneous Integration of Multi-omics Data Improves the Identification of Cancer Driver Modules.

Cell Syst. 2019 May 22;8(5):456-466.e5. doi: 10.1016/j.cels.2019.04.005. Epub 2019 May 15.

BeWith: A Between-Within method to discover relationships between cancer modules via integrated analysis of mutual exclusivity, co-occurrence and functional interactions.

PLoS Comput Biol. 2017 Oct 12;13(10):e1005695. doi: 10.1371/journal.pcbi.1005695. eCollection 2017 Oct.

Identification of mutated core cancer modules by integrating somatic mutation, copy number variation, and gene expression data.

BMC Syst Biol. 2013;7 Suppl 2(Suppl 2):S4. doi: 10.1186/1752-0509-7-S2-S4. Epub 2013 Oct 14.

Simultaneous identification of multiple driver pathways in cancer.

PLoS Comput Biol. 2013;9(5):e1003054. doi: 10.1371/journal.pcbi.1003054. Epub 2013 May 23.

The Integrative Method Based on the Module-Network for Identifying Driver Genes in Cancer Subtypes.

Molecules. 2018 Jan 24;23(2):183. doi: 10.3390/molecules23020183.

Identification of candidate cancer drivers by integrative Epi-DNA and Gene Expression (iEDGE) data analysis.

Sci Rep. 2019 Nov 15;9(1):16904. doi: 10.1038/s41598-019-52886-z.

Identifying mutated driver pathways in cancer by integrating multi-omics data.

Comput Biol Chem. 2019 Jun;80:159-167. doi: 10.1016/j.compbiolchem.2019.03.019. Epub 2019 Apr 2.

Using MEMo to discover mutual exclusivity modules in cancer.

Curr Protoc Bioinformatics. 2013 Mar;Chapter 8:8.17.1-8.17.12. doi: 10.1002/0471250953.bi0817s41.

A comparative study of multi-omics integration tools for cancer driver gene identification and tumour subtyping.

Brief Bioinform. 2020 Dec 1;21(6):1920-1936. doi: 10.1093/bib/bbz121.

CDPath: Cooperative Driver Pathways Discovery Using Integer Linear Programming and Markov Clustering.

IEEE/ACM Trans Comput Biol Bioinform. 2021 Jul-Aug;18(4):1384-1395. doi: 10.1109/TCBB.2019.2945029. Epub 2021 Aug 6.

引用本文的文献

Reproducible processing of TCGA regulatory networks.

bioRxiv. 2024 Nov 7:2024.11.05.622163. doi: 10.1101/2024.11.05.622163.

Multiview representation learning for identification of novel cancer genes and their causative biological mechanisms.

Brief Bioinform. 2024 Jul 25;25(5). doi: 10.1093/bib/bbae418.

Current and future directions in network biology.

Bioinform Adv. 2024 Aug 14;4(1):vbae099. doi: 10.1093/bioadv/vbae099. eCollection 2024.

CGMega: explainable graph neural network framework with attention mechanisms for cancer gene module dissection.

Nat Commun. 2024 Jul 17;15(1):5997. doi: 10.1038/s41467-024-50426-6.

node2vec2rank: Large Scale and Stable Graph Differential Analysis via Multi-Layer Node Embeddings and Ranking.

bioRxiv. 2024 Jun 17:2024.06.16.599201. doi: 10.1101/2024.06.16.599201.

Multi-omics data integration using ratio-based quantitative profiling with Quartet reference materials.

Nat Biotechnol. 2024 Jul;42(7):1133-1149. doi: 10.1038/s41587-023-01934-1. Epub 2023 Sep 7.

Identification of functional gene modules by integrating multi-omics data and known molecular interactions.

Front Genet. 2023 Jan 24;14:1082032. doi: 10.3389/fgene.2023.1082032. eCollection 2023.

Integrating multi-type aberrations from DNA and RNA through dynamic mapping gene space for subtype-specific breast cancer driver discovery.

PeerJ. 2023 Feb 3;11:e14843. doi: 10.7717/peerj.14843. eCollection 2023.

Multi-omics subtyping of hepatocellular carcinoma patients using a Bayesian network mixture model.

PLoS Comput Biol. 2022 Sep 6;18(9):e1009767. doi: 10.1371/journal.pcbi.1009767. eCollection 2022 Sep.

SUBATOMIC: a SUbgraph BAsed mulTi-OMIcs clustering framework to analyze integrated multi-edge networks.

BMC Bioinformatics. 2022 Sep 5;23(1):363. doi: 10.1186/s12859-022-04908-3.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

多组学数据的同步整合提高了癌症驱动模块的识别能力。

Simultaneous Integration of Multi-omics Data Improves the Identification of Cancer Driver Modules.

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献