一种通过利用6697例乳腺癌患者的下一代测序和基因芯片数据将基因型与临床结果相联系的全基因组方法。

A genome-wide approach to link genotype to clinical outcome by utilizing next generation sequencing and gene chip data of 6,697 breast cancer patients.

作者信息

Pongor Lőrinc, Kormos Máté, Hatzis Christos, Pusztai Lajos, Szabó András, Győrffy Balázs

机构信息

MTA TTK Lendület Cancer Biomarker Research Group, Research Centre for Natural Sciences, Magyar tudósok körútja 2, Budapest, H-1117, Hungary.

2nd Department of Pediatrics, Semmelweis University, Budapest, Hungary.

出版信息

Genome Med. 2015 Oct 16;7:104. doi: 10.1186/s13073-015-0228-1.

DOI:10.1186/s13073-015-0228-1

PMID:26474971

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4609150/

Abstract

BACKGROUND

The use of somatic mutations for predicting clinical outcome is difficult because a mutation can indirectly influence the function of many genes, and also because clinical follow-up is sparse in the relatively young next generation sequencing (NGS) databanks. Here we approach this problem by linking sequence databanks to well annotated gene-chip datasets, using a multigene transcriptomic fingerprint as a link between gene mutations and gene expression in breast cancer patients.

METHODS

The database consists of 763 NGS samples containing mutational status for 22,938 genes and RNA-seq data for 10,987 genes. The gene chip database contains 5,934 patients with 10,987 genes plus clinical characteristics. For the prediction, mutations present in a sample are first translated into a 'transcriptomic fingerprint' by running ROC analysis on mutation and RNA-seq data. Then correlation to survival is assessed by computing Cox regression for both up- and downregulated signatures.

RESULTS

According to this approach, the top driver oncogenes having a mutation prevalence over 5 % included AKT1, TRANK1, TRAPPC10, RPGR, COL6A2, RAPGEF4, ATG2B, CNTRL, NAA38, OSBPL10, POTEF, SCLT1, SUN1, VWDE, MTUS2, and PIK3CA, and the top tumor suppressor genes included PHEX, TP53, GGA3, RGS22, PXDNL, ARFGEF1, BRCA2, CHD8, GCC2, and ARMC4. The system was validated by computing correlation between RNA-seq and microarray data (r(2) = 0.73, P < 1E-16). Cross-validation using 20 genes with a prevalence of approximately 5 % confirmed analysis reproducibility.

CONCLUSIONS

We established a pipeline enabling rapid clinical validation of a discovered mutation in a large breast cancer cohort. An online interface is available for evaluating any human gene mutation or combinations of maximum three such genes ( http://www.g-2-o.com ).

摘要

背景

利用体细胞突变预测临床结果存在困难，原因在于一个突变可能间接影响许多基因的功能，还因为在相对年轻的下一代测序（NGS）数据库中临床随访数据稀少。在此，我们通过将序列数据库与注释完善的基因芯片数据集相链接来解决这一问题，使用多基因转录组指纹作为乳腺癌患者基因突变与基因表达之间的联系。

方法

该数据库由763个NGS样本组成，包含22938个基因的突变状态以及10987个基因的RNA测序数据。基因芯片数据库包含5934名患者的10987个基因及临床特征。为进行预测，首先通过对突变和RNA测序数据进行ROC分析，将样本中存在的突变转化为“转录组指纹”。然后通过计算上调和下调特征的Cox回归来评估与生存的相关性。

结果

根据此方法，突变发生率超过5%的主要驱动癌基因包括AKT1、TRANK1、TRAPPC10、RPGR、COL6A2、RAPGEF4、ATG2B、CNTRL、NAA38、OSBPL10、POTEF、SCLT1、SUN1、VWDE、MTUS2和PIK3CA，主要肿瘤抑制基因包括PHEX、TP53、GGA3、RGS22、PXDNL、ARFGEF1、BRCA2、CHD8、GCC2和ARMC4。通过计算RNA测序与微阵列数据之间的相关性（r² = 0.73，P < 1E - 16）对该系统进行了验证。使用发生率约为5%的20个基因进行交叉验证证实了分析的可重复性。

结论

我们建立了一种流程，能够在大型乳腺癌队列中对发现的突变进行快速临床验证。可通过在线界面（http://www.g - 2 - o.com）评估任何人类基因突变或最多三个此类基因的组合。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8b87/4609150/d25269f977f2/13073_2015_228_Fig1_HTML.jpg

相似文献

A genome-wide approach to link genotype to clinical outcome by utilizing next generation sequencing and gene chip data of 6,697 breast cancer patients.

Genome Med. 2015 Oct 16;7:104. doi: 10.1186/s13073-015-0228-1.

Frequency of mutations in individuals with breast cancer referred for BRCA1 and BRCA2 testing using next-generation sequencing with a 25-gene panel.

Cancer. 2015 Jan 1;121(1):25-33. doi: 10.1002/cncr.29010. Epub 2014 Sep 3.

Detection of genomic variations in BRCA1 and BRCA2 genes by long-range PCR and next-generation sequencing.

J Mol Diagn. 2012 May-Jun;14(3):286-93. doi: 10.1016/j.jmoldx.2012.01.013. Epub 2012 Mar 16.

Significance of TP53 mutations determined by next-generation "deep" sequencing in prognosis of estrogen receptor-positive breast cancer.

Cancer Lett. 2014 Jan 1;342(1):19-26. doi: 10.1016/j.canlet.2013.08.028. Epub 2013 Aug 21.

Development and analytical validation of a 25-gene next generation sequencing panel that includes the BRCA1 and BRCA2 genes to assess hereditary cancer risk.

BMC Cancer. 2015 Apr 2;15:215. doi: 10.1186/s12885-015-1224-y.

Long-range PCR and next-generation sequencing of BRCA1 and BRCA2 in breast cancer.

J Mol Diagn. 2012 Sep;14(5):467-75. doi: 10.1016/j.jmoldx.2012.03.006. Epub 2012 Aug 6.

RecurrenceOnline: an online analysis tool to determine breast cancer recurrence and hormone receptor status using microarray data.

Breast Cancer Res Treat. 2012 Apr;132(3):1025-34. doi: 10.1007/s10549-011-1676-y. Epub 2011 Jul 16.

Diagnosis of copy number variation by Illumina next generation sequencing is comparable in performance to oligonucleotide array comparative genomic hybridisation.

Genomics. 2013 Sep;102(3):174-81. doi: 10.1016/j.ygeno.2013.04.006. Epub 2013 Apr 15.

muTarget: A platform linking gene expression changes and mutation status in solid tumors.

Int J Cancer. 2021 Jan 15;148(2):502-511. doi: 10.1002/ijc.33283. Epub 2020 Sep 17.

Use of expression data and the CGEMS genome-wide breast cancer association study to identify genes that may modify risk in BRCA1/2 mutation carriers.

Breast Cancer Res Treat. 2008 Nov;112(2):229-36. doi: 10.1007/s10549-007-9848-5. Epub 2007 Dec 20.

引用本文的文献

Alectinib efficacy in advanced lung adenocarcinoma with coexistence of a novel ALK-MTUS2 and STRN3-ALK double fusion: A case report and literature review.

Oncol Lett. 2025 Jul 7;30(3):432. doi: 10.3892/ol.2025.15178. eCollection 2025 Sep.

Identification and characterization of eccDNA-driven genes in humans.

PLoS One. 2025 Jun 6;20(6):e0324438. doi: 10.1371/journal.pone.0324438. eCollection 2025.

Proteomic Analysis of Invasive Breast Cancer Cells Treated with CBD Reveals Proteins Associated with the Reversal of Their Epithelial-Mesenchymal Transition Induced by IL-1β.

Int J Mol Sci. 2025 May 15;26(10):4721. doi: 10.3390/ijms26104721.

Aberrant expression of CNTRL was associated with poor prognosis, immune response and progression in glioma.

Discov Oncol. 2025 May 9;16(1):706. doi: 10.1007/s12672-025-02531-1.

Upregulated PXDNL promotes invasive breast carcinoma progression.

Am J Transl Res. 2025 Mar 15;17(3):2154-2165. doi: 10.62347/BOUC4040. eCollection 2025.

Evaluating Tumour Mutational Burden as a Key Biomarker in Personalized Cancer Immunotherapy: A Pan-Cancer Systematic Review.

Cancers (Basel). 2025 Feb 1;17(3):480. doi: 10.3390/cancers17030480.

ENO1 as a Biomarker of Breast Cancer Progression and Metastasis: A Bioinformatic Approach Using Available Databases.

Breast Cancer (Auckl). 2024 Oct 19;18:11782234241285648. doi: 10.1177/11782234241285648. eCollection 2024.

Single-stranded pre-methylated 5mC adapters uncover the methylation profile of plasma ultrashort Single-stranded cell-free DNA.

Nucleic Acids Res. 2024 Jun 24;52(11):e50. doi: 10.1093/nar/gkae276.

The largest genome-wide association study for breast cancer in Taiwanese Han population.

Breast Cancer Res Treat. 2024 Jan;203(2):291-306. doi: 10.1007/s10549-023-07133-5. Epub 2023 Oct 18.

Outer dynein arm docking complex subunit 2 polymorphism rs7893462 modulates hepatocellular carcinoma susceptibility and can serve as an overall survival biomarker for hepatitis B virus-related hepatocellular carcinoma after hepatectomy: a cohort study with a long-term follow-up.

World J Surg Oncol. 2023 Oct 13;21(1):322. doi: 10.1186/s12957-023-03205-4.

本文引用的文献

COSMIC: exploring the world's knowledge of somatic mutations in human cancer.

Nucleic Acids Res. 2015 Jan;43(Database issue):D805-11. doi: 10.1093/nar/gku1075. Epub 2014 Oct 29.

Dynamic classification using case-specific training cohorts outperforms static gene expression signatures in breast cancer.

Int J Cancer. 2015 May 1;136(9):2091-8. doi: 10.1002/ijc.29247. Epub 2014 Oct 11.

Cyclin-dependent kinase 4/6 inhibitors in breast cancer therapy.

Curr Opin Oncol. 2014 Nov;26(6):568-75. doi: 10.1097/CCO.0000000000000129.

The concordance between RNA-seq and microarray data depends on chemical treatment and transcript abundance.

Nat Biotechnol. 2014 Sep;32(9):926-32. doi: 10.1038/nbt.3001. Epub 2014 Aug 24.

A targeted next-generation sequencing assay detects a high frequency of therapeutically targetable alterations in primary and metastatic breast cancers: implications for clinical practice.

Oncologist. 2014 May;19(5):453-8. doi: 10.1634/theoncologist.2013-0377. Epub 2014 Apr 7.

Comparative genomic hybridisation array and DNA sequencing to direct treatment of metastatic breast cancer: a multicentre, prospective trial (SAFIR01/UNICANCER).

Lancet Oncol. 2014 Mar;15(3):267-74. doi: 10.1016/S1470-2045(13)70611-9. Epub 2014 Feb 7.

Mutational landscape of the essential autophagy gene BECN1 in human cancers.

Mol Cancer Res. 2014 Apr;12(4):485-90. doi: 10.1158/1541-7786.MCR-13-0614. Epub 2014 Jan 29.

TP53 mutation-correlated genes predict the risk of tumor relapse and identify MPS1 as a potential therapeutic kinase in TP53-mutated breast cancers.

Mol Oncol. 2014 May;8(3):508-19. doi: 10.1016/j.molonc.2013.12.018. Epub 2014 Jan 5.

Online survival analysis software to assess the prognostic value of biomarkers using transcriptomic data in non-small-cell lung cancer.

PLoS One. 2013 Dec 18;8(12):e82241. doi: 10.1371/journal.pone.0082241. eCollection 2013.

BreastMark: an integrated approach to mining publicly available transcriptomic datasets relating to breast cancer outcome.

Breast Cancer Res. 2013;15(4):R52. doi: 10.1186/bcr3444.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

一种通过利用6697例乳腺癌患者的下一代测序和基因芯片数据将基因型与临床结果相联系的全基因组方法。

A genome-wide approach to link genotype to clinical outcome by utilizing next generation sequencing and gene chip data of 6,697 breast cancer patients.

作者信息

Pongor Lőrinc, Kormos Máté, Hatzis Christos, Pusztai Lajos, Szabó András, Győrffy Balázs

机构信息

MTA TTK Lendület Cancer Biomarker Research Group, Research Centre for Natural Sciences, Magyar tudósok körútja 2, Budapest, H-1117, Hungary.

2nd Department of Pediatrics, Semmelweis University, Budapest, Hungary.