Zhao Wei, Hoadley Katherine A, Parker Joel S, Perou Charles M
Department of Systems Biology, University of Texas MD Anderson Cancer Center, 77054, Houston, TX, USA.
Curriculum in Bioinformatics and Computational Biology, University of North Carolina, 27599, Chapel Hill, NC, USA.
BMC Genomics. 2016 Mar 3;17:181. doi: 10.1186/s12864-016-2521-9.
Alternative splicing provides a major mechanism to generate protein diversity. Increasing evidence suggests a link of dysregulation of splicing associated with cancer. While previous genomic-based studies demonstrated the expression of a handful of tumor-specific isoforms, genome-wide alterations in the balance between isoforms and cancer subtypes is understudied.
We systematically analyzed the isoform-level expression patterns and isoform switching events of 819 breast tumor and normal samples assayed by mRNA-seq from TCGA project. On average, 2.2 isoforms per gene were detected and 67.5 % of detected genes (i.e. expressed) showed 1-2 isoforms only. While the majority of isoforms for a given gene were positively correlated with each other and the overall gene level, 470 pairs of isoforms displayed an inverse correlation suggesting a switching event. Most of the isoform switching events were associated with molecular subtypes, including a Basal-like-associated switching in CTNND1. 88 genes showed switching independent of subtypes, among which the isoform pattern of PRICKLE1 was associated with a large genomic signature of biological significance.
Our results reveal that the majority of genes do not undergo complex mRNA splicing within breast cancers, and that there is a general concordance in isoform and gene expression levels in breast tumors. We identified hundreds of isoform switching events across breast tumors, most of which were associated with differences in tumor subtypes. As exemplified by the detailed analysis of CTNND1 and PRICKLE1, these isoform switching events potentially provide new insights into the post-transcriptional regulatory mechanisms of tumor subtypes and cancer biology.
可变剪接是产生蛋白质多样性的主要机制。越来越多的证据表明剪接失调与癌症有关。虽然先前基于基因组的研究证明了少数肿瘤特异性异构体的表达,但异构体平衡与癌症亚型之间的全基因组改变仍未得到充分研究。
我们系统地分析了来自TCGA项目的819个乳腺癌肿瘤和正常样本通过mRNA测序检测到的异构体水平表达模式和异构体转换事件。平均每个基因检测到2.2种异构体,67.5%的检测到的基因(即表达的基因)仅显示1 - 2种异构体。虽然给定基因的大多数异构体彼此之间以及与总体基因水平呈正相关,但470对异构体显示出负相关,表明存在转换事件。大多数异构体转换事件与分子亚型相关,包括CTNND1中与基底样相关的转换。88个基因显示出与亚型无关的转换,其中PRICKLE1的异构体模式与具有生物学意义的大基因组特征相关。
我们的结果表明,大多数基因在乳腺癌中不经历复杂的mRNA剪接,并且乳腺癌中异构体和基因表达水平总体上是一致的。我们在乳腺癌中鉴定出数百个异构体转换事件,其中大多数与肿瘤亚型的差异相关。以CTNND1和PRICKLE1的详细分析为例,这些异构体转换事件可能为肿瘤亚型的转录后调控机制和癌症生物学提供新的见解。