Suppr
超能文献

整合突变和基因表达横断面数据以推断癌症进展。

Integrating mutation and gene expression cross-sectional data to infer cancer progression.

作者信息

Fleck Julia L, Pavel Ana B, Cassandras Christos G

机构信息

Division of Systems Engineering, Boston University, 15 Saint Mary's Street, Brookline, MA 02446, USA.

Graduate Program in Bioinformatics, Boston University, 24 Cummington Mall, Boston, MA 02215, USA.

出版信息

BMC Syst Biol. 2016 Jan 25;10:12. doi: 10.1186/s12918-016-0255-6.

DOI:10.1186/s12918-016-0255-6

PMID:26810975

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4727329/

Abstract

BACKGROUND

A major problem in identifying the best therapeutic targets for cancer is the molecular heterogeneity of the disease. Cancer is often caused by an accumulation of mutations which produce irreversible damage to the cell's control mechanisms of survival and proliferation. Different mutations may affect these cellular anachronisms through a combination of molecular interactions which may be dynamically changing during cancer progression. It has been previously shown that cancer accumulates mutations over time. In this paper we address the problem of cancer heterogeneity by modeling cancer progression using somatic mutation and gene expression cross-sectional data.

RESULTS

We propose a novel formulation of integrating somatic mutation and gene expression data to infer the temporal sequence of events from cross-sectional data. Using a mixed integer linear program we model the interaction between groups of different mutated genes and the resulting modifications at the gene expression level. Our approach identifies a partition of mutation events which gradually produce gene expression changes to a partition of genes over time. The proposed formulation is tested using both simulated data and real breast cancer data with matched somatic mutations and gene expression measurements from The Cancer Genome Atlas. First, we classify the genes as oncogenes or tumor suppressors based on the frequency of driver mutations. As expected, the most frequently mutated genes in breast cancer are PIK3CA and TP53 genes. Then, we select those genes with most frequent driver mutations and a set of genes known to play roles in cancer development. Furthermore, we apply the proposed mixed integer linear program to identify the temporal order in which genes mutate and, simultaneously, the changes they produce at the gene expression level during cancer progression. In addition, we are able to identify known causal relationships between mutations and gene expression changes in PI3K/AKT and TP53 pathways.

CONCLUSIONS

This paper proposes a new model to infer the temporal sequence in which mutations occur and lead to changes at the gene expression level during cancer progression. The approach is general and can be applied to any data sets with available somatic mutations and gene expression measurements.

摘要

背景

确定癌症最佳治疗靶点的一个主要问题是该疾病的分子异质性。癌症通常由突变积累引起，这些突变会对细胞的生存和增殖控制机制造成不可逆的损害。不同的突变可能通过分子相互作用的组合影响这些细胞异常现象，而这些相互作用在癌症进展过程中可能会动态变化。先前已表明癌症会随着时间积累突变。在本文中，我们通过使用体细胞突变和基因表达横断面数据对癌症进展进行建模，来解决癌症异质性问题。

结果

我们提出了一种整合体细胞突变和基因表达数据的新方法，以从横断面数据推断事件的时间顺序。使用混合整数线性规划，我们对不同突变基因组之间的相互作用以及基因表达水平上产生的修饰进行建模。我们的方法确定了突变事件的一个划分，随着时间的推移，这些事件会逐渐使基因表达发生变化，形成基因的一个划分。使用来自癌症基因组图谱的匹配体细胞突变和基因表达测量数据，对模拟数据和真实乳腺癌数据进行了测试。首先，我们根据驱动突变的频率将基因分类为癌基因或肿瘤抑制基因。不出所料，乳腺癌中最常发生突变的基因是PIK3CA和TP53基因。然后，我们选择那些具有最频繁驱动突变的基因以及一组已知在癌症发展中起作用的基因。此外，我们应用所提出的混合整数线性规划来确定基因发生突变的时间顺序，以及它们在癌症进展过程中在基因表达水平上产生的变化。此外，我们能够识别PI3K/AKT和TP53途径中突变与基因表达变化之间已知的因果关系。

结论

本文提出了一种新模型，用于推断癌症进展过程中突变发生并导致基因表达水平变化的时间顺序。该方法具有通用性，可应用于任何具有可用体细胞突变和基因表达测量数据的数据集。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c365/4727329/bf84e9c9aa62/12918_2016_255_Fig1_HTML.jpg

相似文献

Integrating mutation and gene expression cross-sectional data to infer cancer progression.

BMC Syst Biol. 2016 Jan 25;10:12. doi: 10.1186/s12918-016-0255-6.

Simultaneous inference of cancer pathways and tumor progression from cross-sectional mutation data.

J Comput Biol. 2015 Jun;22(6):510-27. doi: 10.1089/cmb.2014.0161. Epub 2015 Mar 18.

Identification of mutated core cancer modules by integrating somatic mutation, copy number variation, and gene expression data.

BMC Syst Biol. 2013;7 Suppl 2(Suppl 2):S4. doi: 10.1186/1752-0509-7-S2-S4. Epub 2013 Oct 14.

Integrative modeling of multi-omics data to identify cancer drivers and infer patient-specific gene activity.

BMC Syst Biol. 2016 Feb 11;10:16. doi: 10.1186/s12918-016-0260-9.

Integration of somatic mutation, expression and functional data reveals potential driver genes predictive of breast cancer survival.

Bioinformatics. 2015 Aug 15;31(16):2607-13. doi: 10.1093/bioinformatics/btv164. Epub 2015 Mar 24.

TP53 mutations, expression and interaction networks in human cancers.

Oncotarget. 2017 Jan 3;8(1):624-643. doi: 10.18632/oncotarget.13483.

Frequent mutations in acetylation and ubiquitination sites suggest novel driver mechanisms of cancer.

Genome Med. 2016 May 12;8(1):55. doi: 10.1186/s13073-016-0311-2.

HSP27 expression in primary colorectal cancers is dependent on mutation of KRAS and PI3K/AKT activation status and is independent of TP53.

Exp Mol Pathol. 2013 Feb;94(1):103-8. doi: 10.1016/j.yexmp.2012.09.001. Epub 2012 Sep 12.

Mutational characterization of individual breast tumors: TP53 and PI3K pathway genes are frequently and distinctively mutated in different subtypes.

Breast Cancer Res Treat. 2012 Feb;132(1):29-39. doi: 10.1007/s10549-011-1518-y. Epub 2011 Apr 22.

ZDOG: zooming in on dominating genes with mutations in cancer pathways.

BMC Bioinformatics. 2019 Dec 30;20(1):740. doi: 10.1186/s12859-019-3326-z.

引用本文的文献

SNPs-Panel Polymorphism Variations in and Genes Are Not Associated with Prostate Cancer.

Biomedicines. 2023 Dec 11;11(12):3276. doi: 10.3390/biomedicines11123276.

Combining bulk and single-cell RNA-sequencing data to develop an NK cell-related prognostic signature for hepatocellular carcinoma based on an integrated machine learning framework.

Eur J Med Res. 2023 Aug 30;28(1):306. doi: 10.1186/s40001-023-01300-6.

Acute Myeloid Leukemia Expresses a Specific Group of Olfactory Receptors.

Cancers (Basel). 2023 Jun 6;15(12):3073. doi: 10.3390/cancers15123073.

Mathematical modeling the order of driver gene mutations in colorectal cancer.

PLoS Comput Biol. 2023 Jun 27;19(6):e1011225. doi: 10.1371/journal.pcbi.1011225. eCollection 2023 Jun.

Construction and comprehensive analysis of a novel prognostic signature associated with pyroptosis molecular subtypes in patients with pancreatic adenocarcinoma.

Front Immunol. 2023 Feb 3;14:1111494. doi: 10.3389/fimmu.2023.1111494. eCollection 2023.

An Analysis of Transcriptomic Burden Identifies Biological Progression Roadmaps for Hematological Malignancies and Solid Tumors.

Biomedicines. 2022 Oct 27;10(11):2720. doi: 10.3390/biomedicines10112720.

Predicting drug sensitivity of cancer cells based on DNA methylation levels.

PLoS One. 2021 Sep 10;16(9):e0238757. doi: 10.1371/journal.pone.0238757. eCollection 2021.

Identification of Breast Cancer Subtype-Specific Biomarkers by Integrating Copy Number Alterations and Gene Expression Profiles.

Medicina (Kaunas). 2021 Mar 12;57(3):261. doi: 10.3390/medicina57030261.

Feature Selection for Breast Cancer Classification by Integrating Somatic Mutation and Gene Expression.

Front Genet. 2021 Feb 26;12:629946. doi: 10.3389/fgene.2021.629946. eCollection 2021.

Derivation and Application of Molecular Signatures to Prostate Cancer: Opportunities and Challenges.

Cancers (Basel). 2021 Jan 28;13(3):495. doi: 10.3390/cancers13030495.

本文引用的文献

Simultaneous inference of cancer pathways and tumor progression from cross-sectional mutation data.

J Comput Biol. 2015 Jun;22(6):510-27. doi: 10.1089/cmb.2014.0161. Epub 2015 Mar 18.

Inferring tree causal models of cancer progression with probability raising.

PLoS One. 2014 Oct 9;9(10):e108358. doi: 10.1371/journal.pone.0108358. eCollection 2014.

Comprehensive molecular profiling of lung adenocarcinoma.

Nature. 2014 Jul 31;511(7511):543-50. doi: 10.1038/nature13385. Epub 2014 Jul 9.

Survival from breast cancer in patients with CHEK2 mutations.

Breast Cancer Res Treat. 2014 Apr;144(2):397-403. doi: 10.1007/s10549-014-2865-2. Epub 2014 Feb 21.

Prognostic role of mutation in patients with triple-negative breast cancer.

Oncol Lett. 2014 Jan;7(1):278-284. doi: 10.3892/ol.2013.1684. Epub 2013 Nov 14.

Data, information, knowledge and principle: back to metabolism in KEGG.

Nucleic Acids Res. 2014 Jan;42(Database issue):D199-205. doi: 10.1093/nar/gkt1076. Epub 2013 Nov 7.

The Cancer Genome Atlas Pan-Cancer analysis project.

Nat Genet. 2013 Oct;45(10):1113-20. doi: 10.1038/ng.2764.

Cancer genome landscapes.

Science. 2013 Mar 29;339(6127):1546-58. doi: 10.1126/science.1235122.

Comprehensive molecular portraits of human breast tumours.

Nature. 2012 Oct 4;490(7418):61-70. doi: 10.1038/nature11412. Epub 2012 Sep 23.

PARADIGM-SHIFT predicts the function of mutations in multiple cancers using pathway impact analysis.

Bioinformatics. 2012 Sep 15;28(18):i640-i646. doi: 10.1093/bioinformatics/bts402.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

Suppr超能文献

整合突变和基因表达横断面数据以推断癌症进展。

Integrating mutation and gene expression cross-sectional data to infer cancer progression.

作者信息

机构信息

出版信息

BACKGROUND

RESULTS

CONCLUSIONS

背景

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译