Yang Mary, Elnitski Laura
BMC Bioinformatics. 2014;15 Suppl 17(Suppl 17):S1. doi: 10.1186/1471-2105-15-S17-S1. Epub 2014 Dec 16.
The presence of bidirectional promoters in all vertebrate species suggests that the promoters may be maintained in orthologous positions. Therefore the identification of the comprehensive orthologous mapping of this type promoter across species can facilitate elucidation of regulatory mechanisms controlling bidirectional gene expression. However, the lack of annotation for many transcribed regions in the genome can impact the orthology designation of these promoters. Human and mouse are among genomes that have been relatively well annotated. Thus we used them as models to study the orthologous patterns of bidirectional promoters.
We developed a method to annotate these regulatory regions by confirming the orthology of the genes found on each side of the promoters. In this manuscript we report the cross-species comparisons between human and mouse genomes, where the bidirectional promoter sets regulating UCSC Known Genes and spliced EST annotations were mapped from human to mouse and vice versa. We validate hundreds of orthologous bidirectional promoters through the presence of orthologous flanking gene annotations in the second species. We also show that regulatory activity of these orthologous promoters confers similar gene expression profiles in 21 tissues of human and mouse. In particular, more than one third of human bidirectional promoters annotated from spliced EST annotations regulate ncRNA, of which over 90% are lncRNAs.
Although evolutionary conservation shows a weaker signature in promoters than coding regions, our technique of mapping of orthologous genes shows that most bidirectional promoter arrangements are conserved across human and mouse genomes, suggesting a critical function. In addition, the similar expression patterns of the orthologous gene sets indicate that the regulatory mechanisms remain largely conserved as well.
所有脊椎动物物种中双向启动子的存在表明这些启动子可能在直系同源位置得以保留。因此,跨物种鉴定此类启动子的全面直系同源图谱有助于阐明控制双向基因表达的调控机制。然而,基因组中许多转录区域缺乏注释会影响这些启动子的直系同源性认定。人类和小鼠的基因组是注释相对完善的基因组。因此,我们以它们为模型来研究双向启动子的直系同源模式。
我们开发了一种方法,通过确认启动子两侧基因的直系同源性来注释这些调控区域。在本论文中,我们报告了人类和小鼠基因组之间的跨物种比较,其中调控UCSC已知基因和剪接EST注释的双向启动子集从人类映射到小鼠,反之亦然。我们通过在第二个物种中存在直系同源侧翼基因注释来验证数百个直系同源双向启动子。我们还表明,这些直系同源启动子的调控活性在人类和小鼠的21种组织中赋予相似的基因表达谱。特别是,从剪接EST注释中注释的人类双向启动子中有超过三分之一调控非编码RNA,其中超过90%是长链非编码RNA。
尽管进化保守性在启动子中比在编码区域表现出较弱的特征,但我们的直系同源基因映射技术表明,大多数双向启动子排列在人类和小鼠基因组中是保守的,这表明其具有关键功能。此外,直系同源基因集的相似表达模式表明调控机制在很大程度上也是保守的。