European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, UK.
ETH Zurich, Zurich, Switzerland.
Nature. 2020 Feb;578(7793):129-136. doi: 10.1038/s41586-020-1970-0. Epub 2020 Feb 5.
Transcript alterations often result from somatic changes in cancer genomes. Various forms of RNA alterations have been described in cancer, including overexpression, altered splicing and gene fusions; however, it is difficult to attribute these to underlying genomic changes owing to heterogeneity among patients and tumour types, and the relatively small cohorts of patients for whom samples have been analysed by both transcriptome and whole-genome sequencing. Here we present, to our knowledge, the most comprehensive catalogue of cancer-associated gene alterations to date, obtained by characterizing tumour transcriptomes from 1,188 donors of the Pan-Cancer Analysis of Whole Genomes (PCAWG) Consortium of the International Cancer Genome Consortium (ICGC) and The Cancer Genome Atlas (TCGA). Using matched whole-genome sequencing data, we associated several categories of RNA alterations with germline and somatic DNA alterations, and identified probable genetic mechanisms. Somatic copy-number alterations were the major drivers of variations in total gene and allele-specific expression. We identified 649 associations of somatic single-nucleotide variants with gene expression in cis, of which 68.4% involved associations with flanking non-coding regions of the gene. We found 1,900 splicing alterations associated with somatic mutations, including the formation of exons within introns in proximity to Alu elements. In addition, 82% of gene fusions were associated with structural variants, including 75 of a new class, termed 'bridged' fusions, in which a third genomic location bridges two genes. We observed transcriptomic alteration signatures that differ between cancer types and have associations with variations in DNA mutational signatures. This compendium of RNA alterations in the genomic context provides a rich resource for identifying genes and mechanisms that are functionally implicated in cancer.
转录本改变通常是癌症基因组体突变的结果。在癌症中已经描述了各种形式的 RNA 改变,包括过表达、剪接改变和基因融合;然而,由于患者和肿瘤类型的异质性以及通过转录组和全基因组测序分析的患者样本相对较少,因此很难将这些归因于潜在的基因组变化。在这里,我们展示了迄今为止最全面的癌症相关基因改变目录,这些改变是通过对国际癌症基因组联盟(ICGC)和癌症基因组图谱(TCGA)的全癌症分析基因组(PCAWG)联盟的 1188 名供体的肿瘤转录组进行特征分析获得的。使用匹配的全基因组测序数据,我们将几种 RNA 改变类别与种系和体细胞 DNA 改变相关联,并确定了可能的遗传机制。体细胞拷贝数改变是总基因和等位基因特异性表达变化的主要驱动因素。我们确定了 649 个与顺式基因表达相关的体细胞单核苷酸变异与基因表达的关联,其中 68.4%涉及与基因侧翼非编码区域的关联。我们发现了 1900 个与体细胞突变相关的剪接改变,包括在靠近 Alu 元件的内含子中形成外显子。此外,82%的基因融合与结构变异相关,包括 75 个新的融合类别,称为“桥接”融合,其中第三个基因组位置桥接两个基因。我们观察到不同癌症类型之间存在转录本改变特征,并与 DNA 突变特征的变化相关。这种基因组背景下的 RNA 改变综合提供了一个丰富的资源,可用于鉴定在癌症中具有功能相关性的基因和机制。