Winter Eitan E, Ponting Chris P
Department of Human Anatomy and Genetics, University of Oxford, Oxford OX1 3QX, UK.
BMC Evol Biol. 2005 Oct 12;5:54. doi: 10.1186/1471-2148-5-54.
The identification of sequence innovations in the genomes of mammals facilitates understanding of human gene function, as well as sheds light on the molecular mechanisms which underlie these changes. Although gene duplication plays a major role in genome evolution, studies regarding concerted evolution events among gene family members have been limited in scope and restricted to protein-coding regions, where high sequence similarity is easily detectable.
We describe a mammalian-specific expansion of more than 20 rapidly-evolving genes on human chromosome Xq22.1. Many of these are highly divergent in their protein-coding regions yet contain a conserved sequence motif in their 5' UTRs which appears to have been maintained by multiple events of concerted evolution. These events have led to the generation of chimaeric genes, each with a 5' UTR and a protein-coding region that possess independent evolutionary histories. We suggest that concerted evolution has occurred via gene conversion independently in different mammalian lineages, and these events have resulted in elevated G+C levels in the encompassing genomic regions. These concerted evolution events occurred within and between genes from three separate protein families ('brain-expressed X-linked' [BEX], WWbp5-like X-linked [WEX] and G-protein-coupled receptor-associated sorting protein [GASP]), which often are expressed in mammalian brains and associated with receptor mediated signalling and apoptosis.
Despite high protein-coding divergence among mammalian-specific genes, we identified a DNA motif common to these genes' 5' UTR exons. The motif has undergone concerted evolution events independently of its neighbouring protein-coding regions, leading to formation of evolutionary chimaeric genes. These findings have implications for the identification of non protein-coding regulatory elements and their lineage-specific evolution in mammals.
鉴定哺乳动物基因组中的序列创新有助于理解人类基因功能,也能揭示这些变化背后的分子机制。尽管基因复制在基因组进化中起主要作用,但关于基因家族成员间协同进化事件的研究范围有限,且局限于蛋白质编码区域,因为在该区域很容易检测到高序列相似性。
我们描述了人类X染色体q22.1上20多个快速进化基因的哺乳动物特异性扩增。其中许多基因在蛋白质编码区域高度分化,但在其5'非翻译区含有一个保守的序列基序,该基序似乎通过多次协同进化事件得以保留。这些事件导致了嵌合基因的产生,每个嵌合基因都有一个5'非翻译区和一个具有独立进化历史的蛋白质编码区域。我们认为,协同进化在不同的哺乳动物谱系中通过基因转换独立发生,这些事件导致了周围基因组区域中G+C水平的升高。这些协同进化事件发生在三个独立蛋白质家族(“脑表达X连锁”[BEX]、WWbp5样X连锁[WEX]和G蛋白偶联受体相关分选蛋白[GASP])的基因内部和之间,这些家族通常在哺乳动物大脑中表达,并与受体介导的信号传导和细胞凋亡相关。
尽管哺乳动物特异性基因在蛋白质编码方面存在高度差异,但我们在这些基因的5'非翻译区外显子中鉴定出了一个共同的DNA基序。该基序独立于其相邻的蛋白质编码区域经历了协同进化事件,导致了进化嵌合基因的形成。这些发现对于鉴定非蛋白质编码调控元件及其在哺乳动物中的谱系特异性进化具有重要意义。