Bioinformatics Division, Center for Synthetic and Systems Biology, Tsinghua National Laboratory for Information Science and Technology, Tsinghua University, Beijing, China.
BMC Genomics. 2012 Aug 28;13:433. doi: 10.1186/1471-2164-13-433.
Ribosomal protein genes (RPGs) are essential, tightly regulated, and highly expressed during embryonic development and cell growth. Even though their protein sequences are strongly conserved, their mechanism of regulation is not conserved across yeast, Drosophila, and vertebrates. A recent investigation of genomic sequences conserved across both nematode species and associated with different gene groups indicated the existence of several elements in the upstream regions of C. elegans RPGs, providing a new insight regarding the regulation of these genes in C. elegans.
In this study, we performed an in-depth examination of C. elegans RPG regulation and found nine highly conserved motifs in the upstream regions of C. elegans RPGs using the motif discovery algorithm DME. Four motifs were partially similar to transcription factor binding sites from C. elegans, Drosophila, yeast, and human. One pair of these motifs was found to co-occur in the upstream regions of 250 transcripts including 22 RPGs. The distance between the two motifs displayed a complex frequency pattern that was related to their relative orientation.We tested the impact of three of these motifs on the expression of rpl-2 using a series of reporter gene constructs and showed that all three motifs are necessary to maintain the high natural expression level of this gene. One of the motifs was similar to the binding site of an orthologue of POP-1, and we showed that RNAi knockdown of pop-1 impacts the expression of rpl-2. We further determined the transcription start site of rpl-2 by 5' RACE and found that the motifs lie 40-90 bases upstream of the start site. We also found evidence that a noncoding RNA, contained within the outron of rpl-2, is co-transcribed with rpl-2 and cleaved during trans-splicing.
Our results indicate that C. elegans RPGs are regulated by a complex novel series of regulatory elements that is evolutionarily distinct from those of all other species examined up until now.
核糖体蛋白基因(RPGs)在胚胎发育和细胞生长过程中是必需的、紧密调节的和高度表达的。尽管它们的蛋白质序列高度保守,但它们的调节机制在酵母、果蝇和脊椎动物中并不保守。最近对线虫物种之间保守的基因组序列的研究以及与不同基因群相关的研究表明,C. elegans RPGs 的上游区域存在几个元件,为这些基因在 C. elegans 中的调节提供了新的见解。
在这项研究中,我们对 C. elegans RPG 的调节进行了深入研究,使用 motif discovery algorithm DME 在 C. elegans RPG 的上游区域发现了九个高度保守的 motif。四个 motif 部分类似于 C. elegans、果蝇、酵母和人类的转录因子结合位点。这四个 motif 中的一对被发现在包括 22 个 RPGs 的 250 个转录本的上游区域中共同出现。两个 motif 之间的距离显示出与它们相对取向相关的复杂频率模式。我们使用一系列报告基因构建体测试了其中三个 motif 对 rpl-2 表达的影响,并表明所有三个 motif 对于维持该基因的高自然表达水平都是必需的。其中一个 motif 类似于 POP-1 的同源物的结合位点,我们表明 pop-1 的 RNAi 敲低会影响 rpl-2 的表达。我们进一步通过 5' RACE 确定了 rpl-2 的转录起始位点,发现 motif 位于起始位点上游 40-90 个碱基处。我们还发现证据表明,rpl-2 的外显子内的非编码 RNA 与 rpl-2 共转录,并在 trans-splicing 过程中被切割。
我们的结果表明,C. elegans RPGs 受一系列复杂的新型调节元件的调节,这些调节元件与迄今为止研究的所有其他物种的调节元件在进化上是不同的。