Mahony Shaun, Corcoran David L, Feingold Eleanor, Benos Panayiotis V
Department of Computational Biology, School of Medicine, University of Pittsburgh, Fifth Avenue, Pittsburgh, PA 15260, USA.
Genome Biol. 2007;8(5):R84. doi: 10.1186/gb-2007-8-5-r84.
Being the first noneutherian mammal sequenced, Monodelphis domestica (opossum) offers great potential for enhancing our understanding of the evolutionary processes that take place in mammals. This study focuses on the evolutionary relationships between conservation of noncoding sequences, cis-regulatory elements, and biologic functions of regulated genes in opossum and eight vertebrate species.
Analysis of 145 intergenic microRNA and all protein coding genes revealed that the upstream sequences of the former are up to twice as conserved as the latter among mammals, except in the first 500 base pairs, where the conservation is similar. Comparison of promoter conservation in 513 protein coding genes and related transcription factor binding sites (TFBSs) showed that 41% of the known human TFBSs are located in the 6.7% of promoter regions that are conserved between human and opossum. Some core biologic processes exhibited significantly fewer conserved TFBSs in human-opossum comparisons, suggesting greater functional divergence. A new measure of efficiency in multigenome phylogenetic footprinting (base regulatory potential rate [BRPR]) shows that including human-opossum conservation increases specificity in finding human TFBSs.
Opossum facilitates better estimation of promoter conservation and TFBS turnover among mammals. The fact that substantial TFBS numbers are located in a small proportion of the human-opossum conserved sequences emphasizes the importance of marsupial genomes for phylogenetic footprinting-based motif discovery strategies. The BRPR measure is expected to help select genome combinations for optimal performance of these algorithms. Finally, although the etiology of the microRNA upstream increased conservation remains unknown, it is expected to have strong implications for our understanding of regulation of their expression.
作为首个被测序的非真兽类哺乳动物,家短尾负鼠为增进我们对哺乳动物进化过程的理解提供了巨大潜力。本研究聚焦于负鼠与其他八个脊椎动物物种中非编码序列的保守性、顺式调控元件以及受调控基因的生物学功能之间的进化关系。
对145个基因间微小RNA和所有蛋白质编码基因的分析表明,在哺乳动物中,前者的上游序列保守程度是后者的两倍,不过在前500个碱基对中,两者的保守程度相似。对513个蛋白质编码基因的启动子保守性及相关转录因子结合位点(TFBSs)的比较显示,41%的已知人类TFBSs位于人类和负鼠之间保守的6.7%的启动子区域。在人类与负鼠的比较中,一些核心生物学过程的保守TFBSs显著减少,这表明功能差异更大。一种多基因组系统发育足迹分析中的新效率衡量指标(碱基调控潜能率[BRPR])表明,纳入人类与负鼠的保守性可提高发现人类TFBSs的特异性。
负鼠有助于更好地估计哺乳动物之间启动子的保守性和TFBS的周转率。大量TFBSs位于人类与负鼠保守序列的一小部分这一事实,强调了有袋类基因组对于基于系统发育足迹的基序发现策略的重要性。BRPR指标有望帮助选择基因组组合,以实现这些算法的最佳性能。最后,尽管微小RNA上游保守性增加的病因尚不清楚,但预计这将对我们理解其表达调控有重要意义。