Sarda Shrutii, Das Avinash, Vinson Charles, Hannenhalli Sridhar
Center for Bioinformatics and Computational Biology, University of Maryland, College Park, Maryland 20742, USA.
Center for Cancer Research, National Cancer Institute, NIH, Bethesda, Maryland 20892, USA.
Genome Res. 2017 Apr;27(4):553-566. doi: 10.1101/gr.212050.116. Epub 2017 Feb 21.
DNA methylation at the promoter of a gene is presumed to render it silent, yet a sizable fraction of genes with methylated proximal promoters exhibit elevated expression. Here, we show, through extensive analysis of the methylome and transcriptome in 34 tissues, that in many such cases, transcription is initiated by a distal upstream CpG island (CGI) located several kilobases away that functions as an alternative promoter. Specifically, such genes are expressed precisely when the neighboring CGI is unmethylated but remain silenced otherwise. Based on CAGE and Pol II localization data, we found strong evidence of transcription initiation at the upstream CGI and a lack thereof at the methylated proximal promoter itself. Consistent with their alternative promoter activity, CGI-initiated transcripts are associated with signals of stable elongation and splicing that extend into the gene body, as evidenced by tissue-specific RNA-seq and other DNA-encoded splice signals. Furthermore, based on both inter- and intra-species analyses, such CGIs were found to be under greater purifying selection relative to CGIs upstream of silenced genes. Overall, our study describes a hitherto unreported conserved mechanism of transcription of genes with methylated proximal promoters in a tissue-specific fashion. Importantly, this phenomenon explains the aberrant expression patterns of some cancer driver genes, potentially due to aberrant hypomethylation of distal CGIs, despite methylation at proximal promoters.
基因启动子处的DNA甲基化被认为会使其沉默,但相当一部分近端启动子甲基化的基因却表现出表达上调。在此,通过对34种组织的甲基化组和转录组进行广泛分析,我们发现,在许多此类情况下,转录是由位于数千碱基之外的远端上游CpG岛(CGI)启动的,该CGI充当替代启动子。具体而言,此类基因恰好在相邻的CGI未甲基化时表达,否则保持沉默。基于CAGE和Pol II定位数据,我们发现了在上游CGI处存在转录起始的有力证据,而在甲基化的近端启动子本身则没有。与它们的替代启动子活性一致,CGI启动的转录本与延伸到基因体中的稳定延伸和剪接信号相关,组织特异性RNA测序和其他DNA编码的剪接信号证明了这一点。此外,基于种间和种内分析,相对于沉默基因上游的CGI,此类CGI受到更大的纯化选择。总体而言,我们的研究描述了一种迄今未报道的以组织特异性方式转录近端启动子甲基化基因的保守机制。重要的是,这种现象解释了一些癌症驱动基因的异常表达模式,这可能是由于远端CGI异常低甲基化所致,尽管近端启动子存在甲基化。