School of Biological Sciences, University of Nebraska, Lincoln, USA.
Genome Biol Evol. 2011;3:588-600. doi: 10.1093/gbe/evr055. Epub 2011 Jun 21.
If rates of postduplication gene retention are positively correlated with levels of functional constraint, then gene duplicates that have been retained in a restricted number of taxonomic lineages would be expected to exhibit relatively low levels of sequence conservation. Paradoxical patterns are presented by gene duplicates that have been retained in a small number of taxa but which are nonetheless subject to strong purifying selection relative to paralogous members of the same multigene family. This pattern suggests that such genes may have been co-opted for novel, lineage-specific functions. One possible example involves the enigmatic globin-E gene (GbE), which appears to be exclusively restricted to birds. Available data indicate that this gene is expressed exclusively in the avian eye, but its physiological function remains a mystery. In contrast to the highly restricted phyletic distribution of GbE, the overwhelming majority of jawed vertebrates (gnathostomes) possess copies of the related cytoglobin (Cygb) and myoglobin (Mb) genes. The purpose of the present study was 1) to assess the phyletic distribution of the Cygb, Mb, and GbE genes among vertebrates, 2) to elucidate the duplicative origins and evolutionary histories of these three genes, and 3) to evaluate the relative levels of functional constraint of these genes based on comparative sequence analysis. To accomplish these objectives, we conducted a combined phylogenetic and comparative genomic analysis involving taxa that represent each of the major lineages of gnathostome vertebrates. Results of synteny comparisons and phylogenetic topology tests revealed that GbE is clearly not the product of a recent, bird-specific duplication event. Instead, GbE originated via duplication of a proto-Mb gene in the stem lineage of gnathostomes. Unlike the Mb gene, which has been retained in all major gnathostome lineages other than amphibians, the GbE gene has been retained only in the lineage leading to modern birds and has been independently lost in at least four major lineages: teleost fish, amphibians, mammals, and nonavian reptiles. Despite the restricted phyletic distribution of this gene, our results indicate that GbE is one of the most highly conserved globins in the avian genome.
如果基因复制后保留的速率与功能约束的水平呈正相关,那么在少数分类群中保留下来的基因重复序列应该表现出相对较低的序列保守性。在少数分类群中保留下来但相对于同一多基因家族的同源基因仍受到强烈纯化选择的基因重复序列呈现出矛盾的模式。这种模式表明,这些基因可能被用于新的、谱系特异性的功能。一个可能的例子涉及神秘的球蛋白-E 基因 (GbE),它似乎只限于鸟类。现有数据表明,该基因仅在鸟类眼睛中表达,但它的生理功能仍然是个谜。与 GbE 的高度限制的系统发生分布形成对比的是,绝大多数有颌脊椎动物(颌口类)都有相关的细胞球蛋白 (Cygb) 和肌红蛋白 (Mb) 基因的副本。本研究的目的是 1)评估 Cygb、Mb 和 GbE 基因在脊椎动物中的系统发生分布,2)阐明这三个基因的复制起源和进化历史,以及 3)基于比较序列分析评估这些基因的功能约束的相对水平。为了实现这些目标,我们进行了一项综合的系统发育和比较基因组分析,涉及代表颌口类脊椎动物主要谱系的分类群。同线性比较和系统发育拓扑测试的结果表明,GbE 显然不是最近鸟类特有的复制事件的产物。相反,GbE 起源于颌口类脊椎动物的祖系中 Mb 基因的复制。与 Mb 基因不同,它在除了两栖类之外的所有主要颌口类谱系中都被保留下来,而 GbE 基因仅在导致现代鸟类的谱系中被保留下来,并在至少四个主要谱系中独立丢失:硬骨鱼、两栖动物、哺乳动物和非鸟类爬行动物。尽管该基因的系统发生分布受到限制,但我们的结果表明,GbE 是鸟类基因组中最保守的球蛋白之一。