Lian Zheng, Karpikov Alexander, Lian Jin, Mahajan Milind C, Hartman Stephen, Gerstein Mark, Snyder Michael, Weissman Sherman M
Department of Genetics, Yale University School of Medicine, New Haven, CT 06520-8005, USA.
Genome Res. 2008 Aug;18(8):1224-37. doi: 10.1101/gr.075804.107. Epub 2008 May 16.
Genomic analyses have been applied extensively to analyze the process of transcription initiation in mammalian cells, but less to transcript 3' end formation and transcription termination. We used a novel approach to prepare 3' end fragments from polyadenylated RNA, and mapped the position of the poly(A) addition site using oligonucleotide arrays tiling 1% of the human genome. This approach revealed more 3' ends than had been annotated. The distribution of these ends relative to RNA polymerase II (PolII) and di- and trimethylated lysine 4 and lysine 36 of histone H3 was compared. A substantial fraction of unannotated 3' ends of RNA are intronic and antisense to the embedding gene. Poly(A) ends of annotated messages lie on average 2 kb upstream of the end of PolII binding (termination). Near the termination sites, and in some internal sites, unphosphorylated and C-terminal domain (CTD) serine 2 phosphorylated PolII (POLR2A) accumulate, suggesting pausing of the polymerase and perhaps dephosphorylation prior to release. Lysine 36 trimethylation occurs across transcribed genes, sometimes alternating with stretches of DNA in which lysine 36 dimethylation is more prominent. Lysine 36 methylation decreases at or near the site of polyadenylation, sometimes disappearing before disappearance of phosphorylated RNA PolII or release of PolII from DNA. Our results suggest that transcription termination loss of histone 3 lysine 36 methylation and later release of RNA polymerase. The latter is often associated with polymerase pausing. Overall, our study reveals extensive sites of poly(A) addition and provides insights into the events that occur during 3' end formation.
基因组分析已被广泛应用于分析哺乳动物细胞中的转录起始过程,但在转录本3'端形成和转录终止方面的应用较少。我们采用了一种新方法从多聚腺苷酸化RNA制备3'端片段,并使用覆盖人类基因组1%的寡核苷酸阵列来定位多聚(A)加尾位点的位置。这种方法揭示的3'端比已注释的更多。我们比较了这些末端相对于RNA聚合酶II(PolII)以及组蛋白H3赖氨酸4和赖氨酸36的二甲基化和三甲基化的分布情况。相当一部分未注释的RNA 3'端位于内含子中且与嵌入基因呈反义关系。已注释转录本的多聚(A)末端平均位于PolII结合末端(终止)上游2 kb处。在终止位点附近以及一些内部位点,未磷酸化的和C末端结构域(CTD)丝氨酸2磷酸化的PolII(POLR2A)会积累,这表明聚合酶会暂停,并且在释放之前可能会发生去磷酸化。赖氨酸36三甲基化发生在整个转录基因上,有时会与赖氨酸36二甲基化更突出的DNA片段交替出现。赖氨酸36甲基化在多聚腺苷酸化位点或其附近会减少,有时在磷酸化的RNA PolII消失或PolII从DNA上释放之前就会消失。我们的结果表明,转录终止与组蛋白3赖氨酸36甲基化的丧失以及随后RNA聚合酶的释放有关。后者通常与聚合酶暂停相关。总体而言,我们的研究揭示了广泛的多聚(A)加尾位点,并深入了解了3'端形成过程中发生的事件。