整合多组学数据进行建模以识别癌症驱动因素并推断患者特异性基因活性。

Integrative modeling of multi-omics data to identify cancer drivers and infer patient-specific gene activity.

作者信息

Pavel Ana B, Sonkin Dmitriy, Reddy Anupama

机构信息

Graduate Program in Bioinformatics, Boston University, 24 Cummington Mall, Boston, 02215, MA, USA.

Section of Computational Biomedicine, Boston University School of Medicine, 72 East Concord Street, Boston, 02118, MA, USA.

出版信息

BMC Syst Biol. 2016 Feb 11;10:16. doi: 10.1186/s12918-016-0260-9.

DOI:10.1186/s12918-016-0260-9

PMID:26864072

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4750289/

Abstract

BACKGROUND

High throughput technologies have been used to profile genes in multiple different dimensions, such as genetic variation, copy number, gene and protein expression, epigenetics, metabolomics. Computational analyses often treat these different data types as independent, leading to an explosion in the number of features making studies under-powered and more importantly do not provide a comprehensive view of the gene's state. We sought to infer gene activity by integrating different dimensions using biological knowledge of oncogenes and tumor suppressors.

RESULTS

This paper proposes an integrative model of oncogene and tumor suppressor activity in cells which is used to identify cancer drivers and compute patient-specific gene activity scores. We have developed a Fuzzy Logic Modeling (FLM) framework to incorporate biological knowledge with multi-omics data such as somatic mutation, gene expression and copy number measurements. The advantage of using a fuzzy logic approach is to abstract meaningful biological rules from low-level numerical data. Biological knowledge is often qualitative, thus combining it with quantitative numerical measurements may leverage new biological insights about a gene's state. We show that the oncogenic and altered tumor suppressing state of a gene can be better characterized by integrating different molecular measurements with biological knowledge than by each data type alone. We validate the gene activity score using data from the Cancer Cell Line Encyclopedia and drug sensitivity data for five compounds: BYL719 (PIK3CA inhibitor), PLX4720 (BRAF inhibitor), AZD6244 (MEK inhibitor), Erlotinib (EGFR inhibitor), and Nutlin-3 (MDM2 inhibitor). The integrative score improves prediction of drug sensitivity for the known drug targets of these compounds compared to each data type alone. The gene activity scores are also used to cluster colorectal cancer cell lines. Two subtypes of CRCs were found and potential cancer drivers and therapeutic targets for each of the subtypes were identified.

CONCLUSIONS

We propose a fuzzy logic based approach to infer gene activity in cancer by integrating numerical data with descriptive biological knowledge. We compute general patient-specific gene-level scores useful to determine the oncogenic or tumor suppressor status of cancer gene drivers and to cluster or classify patients.

摘要

背景

高通量技术已被用于从多个不同维度对基因进行分析，如遗传变异、拷贝数、基因和蛋白质表达、表观遗传学、代谢组学。计算分析通常将这些不同的数据类型视为相互独立的，导致特征数量激增，使研究的效能不足，更重要的是无法提供基因状态的全面视图。我们试图通过利用癌基因和肿瘤抑制基因的生物学知识整合不同维度来推断基因活性。

结果

本文提出了一种细胞中癌基因和肿瘤抑制基因活性的整合模型，用于识别癌症驱动基因并计算患者特异性基因活性评分。我们开发了一个模糊逻辑建模（FLM）框架，将生物学知识与多组学数据（如体细胞突变、基因表达和拷贝数测量）相结合。使用模糊逻辑方法的优势在于从低级数值数据中抽象出有意义的生物学规则。生物学知识通常是定性的，因此将其与定量数值测量相结合可能会产生关于基因状态的新生物学见解。我们表明，与单独使用每种数据类型相比，通过将不同的分子测量与生物学知识相结合，可以更好地表征基因的致癌和改变的肿瘤抑制状态。我们使用来自癌症细胞系百科全书的数据和五种化合物的药物敏感性数据验证了基因活性评分：BYL719（PIK3CA抑制剂）、PLX4720（BRAF抑制剂）、AZD6244（MEK抑制剂）、厄洛替尼（EGFR抑制剂）和Nutlin-3（MDM2抑制剂）。与单独使用每种数据类型相比，整合评分提高了对这些化合物已知药物靶点的药物敏感性预测。基因活性评分还用于对结肠癌细胞系进行聚类。发现了两种结直肠癌亚型，并确定了每种亚型的潜在癌症驱动基因和治疗靶点。

结论

我们提出了一种基于模糊逻辑的方法，通过将数值数据与描述性生物学知识相结合来推断癌症中的基因活性。我们计算了通用的患者特异性基因水平评分，有助于确定癌症基因驱动基因的致癌或肿瘤抑制状态，并对患者进行聚类或分类。

相似文献

Integrative modeling of multi-omics data to identify cancer drivers and infer patient-specific gene activity.整合多组学数据进行建模以识别癌症驱动因素并推断患者特异性基因活性。

BMC Syst Biol. 2016 Feb 11;10:16. doi: 10.1186/s12918-016-0260-9.

Integrating mutation and gene expression cross-sectional data to infer cancer progression.整合突变和基因表达横断面数据以推断癌症进展。

BMC Syst Biol. 2016 Jan 25;10:12. doi: 10.1186/s12918-016-0255-6.

The Integrative Method Based on the Module-Network for Identifying Driver Genes in Cancer Subtypes.基于模块网络的癌症亚型驱动基因识别的综合方法。

Molecules. 2018 Jan 24;23(2):183. doi: 10.3390/molecules23020183.

Clinical application of genomic profiling to find druggable targets for adolescent and young adult (AYA) cancer patients with metastasis.基因组分析在寻找转移性青少年和青年（AYA）癌症患者可用药靶点方面的临床应用。

BMC Cancer. 2016 Feb 29;16:170. doi: 10.1186/s12885-016-2209-1.

Somatic Copy Number Alterations at Oncogenic Loci Show Diverse Correlations with Gene Expression.致癌基因座的体细胞拷贝数改变与基因表达呈现出多样的相关性。

Sci Rep. 2016 Jan 20;6:19649. doi: 10.1038/srep19649.

Integrative Data Analysis of Multi-Platform Cancer Data with a Multimodal Deep Learning Approach.基于多模态深度学习方法的多平台癌症数据综合数据分析

IEEE/ACM Trans Comput Biol Bioinform. 2015 Jul-Aug;12(4):928-37. doi: 10.1109/TCBB.2014.2377729.

Predictive and prognostic factors in the complex treatment of patients with colorectal cancer.结直肠癌患者综合治疗中的预测和预后因素。

Magy Onkol. 2010 Dec;54(4):383-94. doi: 10.1556/MOnkol.54.2010.4.13.

Adaptive Fuzzy Consensus Clustering Framework for Clustering Analysis of Cancer Data.用于癌症数据聚类分析的自适应模糊共识聚类框架

IEEE/ACM Trans Comput Biol Bioinform. 2015 Jul-Aug;12(4):887-901. doi: 10.1109/TCBB.2014.2359433.

[Lung cancer molecular testing, what role for Next Generation Sequencing and circulating tumor DNA].[肺癌分子检测，下一代测序和循环肿瘤DNA发挥什么作用]

Ann Pathol. 2016 Jan;36(1):80-93. doi: 10.1016/j.annpat.2015.11.012. Epub 2016 Jan 20.

Min-redundancy and max-relevance multi-view feature selection for predicting ovarian cancer survival using multi-omics data.基于多组学数据预测卵巢癌生存的最小冗余最大相关性多视图特征选择。

BMC Med Genomics. 2018 Sep 14;11(Suppl 3):71. doi: 10.1186/s12920-018-0388-0.

引用本文的文献

From Data to Cure: A Comprehensive Exploration of Multi-omics Data Analysis for Targeted Therapies.从数据到治愈：靶向治疗多组学数据分析的全面探索

Mol Biotechnol. 2025 Apr;67(4):1269-1289. doi: 10.1007/s12033-024-01133-6. Epub 2024 Apr 2.

Liquid Biopsies for Monitoring Medulloblastoma: Circulating Tumor DNA as a Biomarker for Disease Progression and Treatment Response.用于监测髓母细胞瘤的液体活检：循环肿瘤DNA作为疾病进展和治疗反应的生物标志物

Cureus. 2024 Jan 5;16(1):e51712. doi: 10.7759/cureus.51712. eCollection 2024 Jan.

Molecular subtyping in colorectal cancer: A bridge to personalized therapy (Review).结直肠癌的分子亚型分型：通向个性化治疗的桥梁（综述）

Oncol Lett. 2023 Apr 18;25(6):230. doi: 10.3892/ol.2023.13816. eCollection 2023 Jun.

Monitoring and modelling the dynamics of the cellular glycolysis pathway: A review and future perspectives.监测和建模细胞糖酵解途径的动力学：综述与未来展望。

Mol Metab. 2022 Dec;66:101635. doi: 10.1016/j.molmet.2022.101635. Epub 2022 Nov 12.

Medulloblastoma cerebrospinal fluid reveals metabolites and lipids indicative of hypoxia and cancer-specific RNAs.脑髓母细胞瘤脑脊液显示出缺氧和癌症特异性 RNA 指示的代谢物和脂质。

Acta Neuropathol Commun. 2022 Feb 24;10(1):25. doi: 10.1186/s40478-022-01326-7.

Synthetic biomarkers: a twenty-first century path to early cancer detection.合成生物标志物：二十一世纪早期癌症检测的新途径。

Nat Rev Cancer. 2021 Oct;21(10):655-668. doi: 10.1038/s41568-021-00389-3. Epub 2021 Sep 6.

GMIEC: a shiny application for the identification of gene-targeted drugs for precision medicine.GMIEC：一个用于精准医学中基因靶向药物识别的闪亮应用。

BMC Genomics. 2020 Sep 10;21(1):619. doi: 10.1186/s12864-020-06996-y.

Editorial: Artificial Intelligence Bioinformatics: Development and Application of Tools for Omics and Inter-Omics Studies.社论：人工智能生物信息学：组学及组学间研究工具的开发与应用

Front Genet. 2020 Apr 9;11:309. doi: 10.3389/fgene.2020.00309. eCollection 2020.

A robust fuzzy rule based integrative feature selection strategy for gene expression data in TCGA.基于鲁棒模糊规则的 TCGA 基因表达数据综合特征选择策略。

BMC Med Genomics. 2019 Jan 31;12(Suppl 1):14. doi: 10.1186/s12920-018-0451-x.

Network-based logistic regression integration method for biomarker identification.用于生物标志物识别的基于网络的逻辑回归集成方法。

BMC Syst Biol. 2018 Dec 31;12(Suppl 9):135. doi: 10.1186/s12918-018-0657-8.

本文引用的文献

Integration of somatic mutation, expression and functional data reveals potential driver genes predictive of breast cancer survival.体细胞突变、表达和功能数据的整合揭示了预测乳腺癌生存的潜在驱动基因。

Bioinformatics. 2015 Aug 15;31(16):2607-13. doi: 10.1093/bioinformatics/btv164. Epub 2015 Mar 24.

Patient-specific driver gene prediction and risk assessment through integrated network analysis of cancer omics profiles.通过癌症组学图谱的综合网络分析进行患者特异性驱动基因预测和风险评估。

Nucleic Acids Res. 2015 Apr 20;43(7):e44. doi: 10.1093/nar/gku1393. Epub 2015 Jan 8.

Comprehensive molecular profiling of lung adenocarcinoma.肺腺癌的全面分子分析。

Nature. 2014 Jul 31;511(7511):543-50. doi: 10.1038/nature13385. Epub 2014 Jul 9.

Relationship between EGFR expression, EGFR mutation status, and the efficacy of chemotherapy plus cetuximab in FLEX study patients with advanced non-small-cell lung cancer.FLEX 研究中晚期非小细胞肺癌患者中 EGFR 表达、EGFR 突变状态与化疗联合西妥昔单抗疗效的关系。

J Thorac Oncol. 2014 May;9(5):717-24. doi: 10.1097/JTO.0000000000000141.

Characterization of the novel and specific PI3Kα inhibitor NVP-BYL719 and development of the patient stratification strategy for clinical trials.新型、特异性 PI3Kα 抑制剂 NVP-BYL719 的鉴定及其临床试验患者分层策略的制定。

Mol Cancer Ther. 2014 May;13(5):1117-29. doi: 10.1158/1535-7163.MCT-13-0865. Epub 2014 Mar 7.

Clinical drug response can be predicted using baseline gene expression levels and in vitro drug sensitivity in cell lines.临床药物反应可以通过基线基因表达水平和细胞系中的体外药物敏感性来预测。

Genome Biol. 2014 Mar 3;15(3):R47. doi: 10.1186/gb-2014-15-3-r47.

The Cancer Genome Atlas Pan-Cancer analysis project.癌症基因组图谱泛癌分析项目。

Nat Genet. 2013 Oct;45(10):1113-20. doi: 10.1038/ng.2764.

Improving breast cancer survival analysis through competition-based multidimensional modeling.基于竞争的多维建模提高乳腺癌生存分析。

PLoS Comput Biol. 2013;9(5):e1003047. doi: 10.1371/journal.pcbi.1003047. Epub 2013 May 9.

Tumor suppressors status in cancer cell line Encyclopedia.肿瘤抑制因子在癌症细胞系百科全书中的状态。

Mol Oncol. 2013 Aug;7(4):791-8. doi: 10.1016/j.molonc.2013.04.001. Epub 2013 Apr 11.

Poor-prognosis colon cancer is defined by a molecularly distinct subtype and develops from serrated precursor lesions.预后不良的结肠癌由分子上明显不同的亚型定义，并由锯齿状前体病变发展而来。

Nat Med. 2013 May;19(5):614-8. doi: 10.1038/nm.3174. Epub 2013 Apr 14.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

整合多组学数据进行建模以识别癌症驱动因素并推断患者特异性基因活性。

Integrative modeling of multi-omics data to identify cancer drivers and infer patient-specific gene activity.

作者信息

机构信息

出版信息

BACKGROUND

RESULTS

CONCLUSIONS

背景

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献