Afik Shaked, Bartok Osnat, Artyomov Maxim N, Shishkin Alexander A, Kadri Sabah, Hanan Mor, Zhu Xiaopeng, Garber Manuel, Kadener Sebastian
Biological Chemistry Department, Silberman Institute of Life Sciences, The Hebrew University, Jerusalem 91904, Israel.
Department of Pathology and Immunology, Washington University School of Medicine, St Louis, MO 63110, USA.
Nucleic Acids Res. 2017 Jun 20;45(11):e95. doi: 10.1093/nar/gkx133.
Cells regulate biological responses in part through changes in transcription start sites (TSS) or cleavage and polyadenylation sites (PAS). To fully understand gene regulatory networks, it is therefore critical to accurately annotate cell type-specific TSS and PAS. Here we present a simple and straightforward approach for genome-wide annotation of 5΄- and 3΄-RNA ends. Our approach reliably discerns bona fide PAS from false PAS that arise due to internal poly(A) tracts, a common problem with current PAS annotation methods. We applied our methodology to study the impact of temperature on the Drosophila melanogaster head transcriptome. We found hundreds of previously unidentified TSS and PAS which revealed two interesting phenomena: first, genes with multiple PASs tend to harbor a motif near the most proximal PAS, which likely represents a new cleavage and polyadenylation signal. Second, motif analysis of promoters of genes affected by temperature suggested that boundary element association factor of 32 kDa (BEAF-32) and DREF mediates a transcriptional program at warm temperatures, a result we validated in a fly line where beaf-32 is downregulated. These results demonstrate the utility of a high-throughput platform for complete experimental and computational analysis of mRNA-ends to improve gene annotation.
细胞部分通过转录起始位点(TSS)或切割及聚腺苷酸化位点(PAS)的变化来调节生物学反应。因此,要全面理解基因调控网络,准确注释细胞类型特异性的TSS和PAS至关重要。在此,我们提出了一种简单直接的方法用于全基因组范围内5′-和3′-RNA末端的注释。我们的方法能够可靠地从因内部多聚腺苷酸序列导致的假PAS中辨别出真正的PAS,这是当前PAS注释方法中常见的问题。我们应用我们的方法来研究温度对黑腹果蝇头部转录组的影响。我们发现了数百个先前未被识别的TSS和PAS,揭示了两个有趣的现象:第一,具有多个PAS的基因往往在最靠近近端的PAS附近含有一个基序,这可能代表一种新的切割及聚腺苷酸化信号。第二,对受温度影响基因的启动子进行基序分析表明,32 kDa的边界元件关联因子(BEAF-32)和DREF在温暖温度下介导了一个转录程序,我们在beaf-32下调的果蝇品系中验证了这一结果。这些结果证明了一个高通量平台在对mRNA末端进行完整实验和计算分析以改善基因注释方面的实用性。