通过应用统计方法提高基因融合的检测率，揭示致癌 RNA 癌症驱动因素。

Improved detection of gene fusions by applying statistical methods reveals oncogenic RNA cancer drivers.

机构信息

Department of Biochemistry, Stanford University, Stanford, CA 94305.

Department of Biomedical Data Science, Stanford University, Stanford, CA 94305.

出版信息

Proc Natl Acad Sci U S A. 2019 Jul 30;116(31):15524-15533. doi: 10.1073/pnas.1900391116. Epub 2019 Jul 15.

DOI:10.1073/pnas.1900391116

PMID:31308241

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6681709/

Abstract

The extent to which gene fusions function as drivers of cancer remains a critical open question. Current algorithms do not sufficiently identify false-positive fusions arising during library preparation, sequencing, and alignment. Here, we introduce Data-Enriched Efficient PrEcise STatistical fusion detection (DEEPEST), an algorithm that uses statistical modeling to minimize false-positives while increasing the sensitivity of fusion detection. In 9,946 tumor RNA-sequencing datasets from The Cancer Genome Atlas (TCGA) across 33 tumor types, DEEPEST identifies 31,007 fusions, 30% more than identified by other methods, while calling 10-fold fewer false-positive fusions in nontransformed human tissues. We leverage the increased precision of DEEPEST to discover fundamental cancer biology. Namely, 888 candidate oncogenes are identified based on overrepresentation in DEEPEST calls, and 1,078 previously unreported fusions involving long intergenic noncoding RNAs, demonstrating a previously unappreciated prevalence and potential for function. DEEPEST also reveals a high enrichment for fusions involving oncogenes in cancers, including ovarian cancer, which has had minimal treatment advances in recent decades, finding that more than 50% of tumors harbor gene fusions predicted to be oncogenic. Specific protein domains are enriched in DEEPEST calls, indicating a global selection for fusion functionality: kinase domains are nearly 2-fold more enriched in DEEPEST calls than expected by chance, as are domains involved in (anaerobic) metabolism and DNA binding. The statistical algorithms, population-level analytic framework, and the biological conclusions of DEEPEST call for increased attention to gene fusions as drivers of cancer and for future research into using fusions for targeted therapy.

摘要

基因融合在多大程度上作为癌症的驱动因素仍然是一个关键的悬而未决的问题。当前的算法不能充分识别在文库制备、测序和比对过程中产生的假阳性融合。在这里，我们引入了 Data-Enriched Efficient PrEcise STatistical fusion detection (DEEPEST)，这是一种利用统计建模来最小化假阳性并提高融合检测灵敏度的算法。在来自癌症基因组图谱（TCGA）的 33 种肿瘤类型的 9946 个肿瘤 RNA 测序数据集上，DEEPEST 识别出 31007 个融合，比其他方法多 30%，而在非转化的人类组织中调用的假阳性融合则少 10 倍。我们利用 DEEPEST 的高精度来发现基本的癌症生物学。即，基于在 DEEPEST 调用中的过表达，鉴定了 888 个候选癌基因，并且鉴定了 1078 个以前未报道的涉及长非编码 RNA 的融合，证明了以前未被认识到的普遍性和潜在功能。DEEPEST 还揭示了癌症中涉及癌基因的融合的高富集性，包括卵巢癌，在过去几十年中，卵巢癌的治疗进展甚微，发现超过 50%的肿瘤携带被预测为致癌的基因融合。在 DEEPEST 调用中富集了特定的蛋白质结构域，表明融合具有全局选择的功能：激酶结构域在 DEEPEST 调用中的富集程度几乎是随机预期的两倍，而参与（厌氧）代谢和 DNA 结合的结构域也是如此。DEEPEST 的统计算法、群体分析框架和生物学结论呼吁增加对基因融合作为癌症驱动因素的关注，并呼吁进一步研究利用融合进行靶向治疗。

相似文献

Improved detection of gene fusions by applying statistical methods reveals oncogenic RNA cancer drivers.

Proc Natl Acad Sci U S A. 2019 Jul 30;116(31):15524-15533. doi: 10.1073/pnas.1900391116. Epub 2019 Jul 15.

annoFuse: an R Package to annotate, prioritize, and interactively explore putative oncogenic RNA fusions.

BMC Bioinformatics. 2020 Dec 14;21(1):577. doi: 10.1186/s12859-020-03922-7.

Systematic discovery of gene fusions in pediatric cancer by integrating RNA-seq and WGS.

BMC Cancer. 2023 Jul 3;23(1):618. doi: 10.1186/s12885-023-11054-3.

SFyNCS detects oncogenic fusions involving non-coding sequences in cancer.

Nucleic Acids Res. 2023 Oct 13;51(18):e96. doi: 10.1093/nar/gkad705.

Discovering and understanding oncogenic gene fusions through data intensive computational approaches.

Nucleic Acids Res. 2016 Jun 2;44(10):4487-503. doi: 10.1093/nar/gkw282. Epub 2016 Apr 21.

Improved detection of clinically relevant fusion transcripts in cancer by machine learning classification.

BMC Genomics. 2023 Dec 18;24(1):783. doi: 10.1186/s12864-023-09889-y.

Landscape of gene fusions in epithelial cancers: seq and ye shall find.

Genome Med. 2015 Dec 18;7:129. doi: 10.1186/s13073-015-0252-1.

Pan-Cancer Analysis Reveals Recurrent BCAR4 Gene Fusions across Solid Tumors.

Mol Cancer Res. 2022 Oct 4;20(10):1481-1488. doi: 10.1158/1541-7786.MCR-21-0775.

The landscape and therapeutic relevance of cancer-associated transcript fusions.

Oncogene. 2015 Sep 10;34(37):4845-54. doi: 10.1038/onc.2014.406. Epub 2014 Dec 15.

Oncofuse: a computational framework for the prediction of the oncogenic potential of gene fusions.

Bioinformatics. 2013 Oct 15;29(20):2539-46. doi: 10.1093/bioinformatics/btt445. Epub 2013 Aug 16.

引用本文的文献

Oncogenic gene fusions in cancer: from biology to therapy.

Signal Transduct Target Ther. 2025 Apr 14;10(1):111. doi: 10.1038/s41392-025-02161-7.

Accurate fusion transcript identification from long- and short-read isoform sequencing at bulk or single-cell resolution.

Genome Res. 2025 Apr 14;35(4):967-986. doi: 10.1101/gr.279200.124.

Architects and Partners: The Dual Roles of Non-coding RNAs in Gene Fusion Events.

Methods Mol Biol. 2025;2883:231-255. doi: 10.1007/978-1-0716-4290-0_10.

Prognostic value of structural variants in early breast cancer patients.

NPJ Breast Cancer. 2024 Jul 27;10(1):64. doi: 10.1038/s41523-024-00669-9.

Readon: a novel algorithm to identify read-through transcripts with long-read sequencing data.

Bioinformatics. 2024 Jun 3;40(6). doi: 10.1093/bioinformatics/btae336.

CTAT-LR-fusion: accurate fusion transcript identification from long and short read isoform sequencing at bulk or single cell resolution.

bioRxiv. 2024 Feb 28:2024.02.24.581862. doi: 10.1101/2024.02.24.581862.

Genome-wide Detection of Chimeric Transcripts in Early-stage Non-small Cell Lung Cancer.

Cancer Genomics Proteomics. 2023 Sep-Oct;20(5):417-432. doi: 10.21873/cgp.20394.

SFyNCS detects oncogenic fusions involving non-coding sequences in cancer.

Nucleic Acids Res. 2023 Oct 13;51(18):e96. doi: 10.1093/nar/gkad705.

Targeted characterization of fusion transcripts in tumor and normal tissues via FusionInspector.

Cell Rep Methods. 2023 May 8;3(5):100467. doi: 10.1016/j.crmeth.2023.100467. eCollection 2023 May 22.

SFyNCS detects oncogenic fusions involving non-coding sequences in cancer.

bioRxiv. 2023 Apr 6:2023.04.03.535462. doi: 10.1101/2023.04.03.535462.

本文引用的文献

CDK12: an emerging therapeutic target for cancer.

J Clin Pathol. 2018 Nov;71(11):957-962. doi: 10.1136/jclinpath-2018-205356. Epub 2018 Aug 13.

Driver Fusions and Their Implications in the Development and Treatment of Human Cancers.

Cell Rep. 2018 Apr 3;23(1):227-238.e3. doi: 10.1016/j.celrep.2018.03.050.

Functional Classification and Experimental Dissection of Long Noncoding RNAs.

Cell. 2018 Jan 25;172(3):393-407. doi: 10.1016/j.cell.2018.01.011.

Long Noncoding RNA in Cancer: Wiring Signaling Circuitry.

Trends Cell Biol. 2018 Apr;28(4):287-301. doi: 10.1016/j.tcb.2017.11.008. Epub 2017 Dec 20.

TumorFusions: an integrative resource for cancer-associated transcript fusions.

Nucleic Acids Res. 2018 Jan 4;46(D1):D1144-D1149. doi: 10.1093/nar/gkx1018.

OncoKB: A Precision Oncology Knowledge Base.

JCO Precis Oncol. 2017 Jul;2017. doi: 10.1200/PO.17.00011. Epub 2017 May 16.

Statistical algorithms improve accuracy of gene fusion detection.

Nucleic Acids Res. 2017 Jul 27;45(13):e126. doi: 10.1093/nar/gkx453.

Applications of Immunogenomics to Cancer.

Cell. 2017 Feb 9;168(4):600-612. doi: 10.1016/j.cell.2017.01.014.

ChimerDB 3.0: an enhanced database for fusion genes from cancer transcriptome and literature data mining.

Nucleic Acids Res. 2017 Jan 4;45(D1):D784-D789. doi: 10.1093/nar/gkw1083. Epub 2016 Nov 28.

Global analysis of somatic structural genomic alterations and their impact on gene expression in diverse human cancers.

Proc Natl Acad Sci U S A. 2016 Nov 29;113(48):13768-13773. doi: 10.1073/pnas.1606220113. Epub 2016 Nov 16.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

通过应用统计方法提高基因融合的检测率，揭示致癌 RNA 癌症驱动因素。

Improved detection of gene fusions by applying statistical methods reveals oncogenic RNA cancer drivers.

机构信息

Department of Biochemistry, Stanford University, Stanford, CA 94305.

Department of Biomedical Data Science, Stanford University, Stanford, CA 94305.

出版信息

Proc Natl Acad Sci U S A. 2019 Jul 30;116(31):15524-15533. doi: 10.1073/pnas.1900391116. Epub 2019 Jul 15.

DOI:10.1073/pnas.1900391116

PMID:31308241

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6681709/

Abstract

摘要

通过应用统计方法提高基因融合的检测率，揭示致癌 RNA 癌症驱动因素。

Improved detection of gene fusions by applying statistical methods reveals oncogenic RNA cancer drivers.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

通过应用统计方法提高基因融合的检测率，揭示致癌 RNA 癌症驱动因素。

Improved detection of gene fusions by applying statistical methods reveals oncogenic RNA cancer drivers.

机构信息

出版信息