Watson School of Biological Sciences, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York 11724, USA.
Genome Res. 2013 Jan;23(1):169-80. doi: 10.1101/gr.139618.112. Epub 2012 Aug 30.
Many eukaryotic genes possess multiple alternative promoters with distinct expression specificities. Therefore, comprehensively annotating promoters and deciphering their individual regulatory dynamics is critical for gene expression profiling applications and for our understanding of regulatory complexity. We introduce RAMPAGE, a novel promoter activity profiling approach that combines extremely specific 5'-complete cDNA sequencing with an integrated data analysis workflow, to address the limitations of current techniques. RAMPAGE features a streamlined protocol for fast and easy generation of highly multiplexed sequencing libraries, offers very high transcription start site specificity, generates accurate and reproducible promoter expression measurements, and yields extensive transcript connectivity information through paired-end cDNA sequencing. We used RAMPAGE in a genome-wide study of promoter activity throughout 36 stages of the life cycle of Drosophila melanogaster, and describe here a comprehensive data set that represents the first available developmental time-course of promoter usage. We found that >40% of developmentally expressed genes have at least two promoters and that alternative promoters generally implement distinct regulatory programs. Transposable elements, long proposed to play a central role in the evolution of their host genomes through their ability to regulate gene expression, contribute at least 1300 promoters shaping the developmental transcriptome of D. melanogaster. Hundreds of these promoters drive the expression of annotated genes, and transposons often impart their own expression specificity upon the genes they regulate. These observations provide support for the theory that transposons may drive regulatory innovation through the distribution of stereotyped cis-regulatory modules throughout their host genomes.
许多真核基因拥有多个具有不同表达特异性的替代启动子。因此,全面注释启动子并破译它们各自的调控动态对于基因表达谱分析应用和我们对调控复杂性的理解至关重要。我们引入了 RAMPAGE,这是一种新的启动子活性分析方法,它结合了极其特异的 5'-完整 cDNA 测序和集成数据分析工作流程,以解决当前技术的局限性。RAMPAGE 具有一个简化的方案,可快速轻松地生成高度多重化的测序文库,提供非常高的转录起始位点特异性,生成准确和可重复的启动子表达测量结果,并通过配对末端 cDNA 测序产生广泛的转录连接信息。我们在黑腹果蝇生命周期的 36 个阶段的全基因组启动子活性研究中使用了 RAMPAGE,并在此处描述了一个全面的数据集,该数据集代表了第一个可用的启动子使用发育时间过程。我们发现,超过 40%的发育表达基因至少有两个启动子,并且替代启动子通常实施不同的调控程序。转座元件长期以来一直被认为通过其调节基因表达的能力在其宿主基因组的进化中发挥核心作用,至少有 1300 个启动子塑造了黑腹果蝇的发育转录组。其中数百个启动子驱动注释基因的表达,并且转座子通常在它们调节的基因上赋予自己的表达特异性。这些观察结果为转座子可能通过在其宿主基因组中分布定型顺式调控模块来驱动调控创新的理论提供了支持。