Department of Computational and Systems Biology, University of Pittsburgh School of Medicine, Pittsburgh, PA 15260, USA.
Nucleic Acids Res. 2012 Sep 1;40(17):8460-71. doi: 10.1093/nar/gks637. Epub 2012 Jun 29.
We present a comprehensive map of over 1 million polyadenylation sites and quantify their usage in major cancers and tumor cell lines using direct RNA sequencing. We built the Expression and Polyadenylation Database to enable the visualization of the polyadenylation maps in various cancers and to facilitate the discovery of novel genes and gene isoforms that are potentially important to tumorigenesis. Analyses of polyadenylation sites indicate that a large fraction (∼30%) of mRNAs contain alternative polyadenylation sites in their 3' untranslated regions, independent of the cell type. The shortest 3' untranslated region isoforms are preferentially upregulated in cancer tissues, genome-wide. Candidate targets of alternative polyadenylation-mediated upregulation of short isoforms include POLR2K, and signaling cascades of cell-cell and cell-extracellular matrix contact, particularly involving regulators of Rho GTPases. Polyadenylation maps also helped to improve 3' untranslated region annotations and identify candidate regulatory marks such as sequence motifs, H3K36Me3 and Pabpc1 that are isoform dependent and occur in a position-specific manner. In summary, these results highlight the need to go beyond monitoring only the cumulative transcript levels for a gene, to separately analysing the expression of its RNA isoforms.
我们呈现了一个超过 100 万个多聚腺苷酸化位点的综合图谱,并使用直接 RNA 测序来量化它们在主要癌症和肿瘤细胞系中的使用情况。我们构建了 Expression and Polyadenylation Database,以便能够在各种癌症中可视化多聚腺苷酸化图谱,并促进对肿瘤发生具有潜在重要性的新基因和基因亚型的发现。多聚腺苷酸化位点的分析表明,在 3'非翻译区中,很大一部分(约 30%)的 mRNA 含有细胞类型独立的选择性多聚腺苷酸化位点。在癌症组织中,最短的 3'非翻译区亚型在全基因组范围内优先上调。选择性多聚腺苷酸化介导的短亚型上调的候选靶标包括 POLR2K,以及细胞-细胞和细胞-细胞外基质接触的信号级联反应,特别是涉及 Rho GTPases 的调节剂。多聚腺苷酸化图谱还有助于改进 3'非翻译区注释,并鉴定候选调控标记,如序列基序、H3K36Me3 和 Pabpc1,这些标记与亚型有关,并且以特定位置的方式发生。总之,这些结果强调需要超越仅监测基因的累积转录水平,而是分别分析其 RNA 亚型的表达。