Ponce Rita, Hartl Daniel L
Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, MA 02138, USA.
Gene. 2006 Jul 19;376(2):174-83. doi: 10.1016/j.gene.2006.02.011. Epub 2006 Jun 12.
The origin of new genes and of new functions for existing genes are fundamental processes in molecular evolution. Sdic is a newly evolved gene that arose recently in the D. melanogaster lineage. The gene encodes a novel sperm motility protein. It is a chimeric gene formed by duplication of two other genes followed by multiple deletions and other sequence rearrangements. The Sdic gene exists in several copies in the X chromosome, and is presumed to have undergone several duplications to form a tandemly arrayed gene cluster. Given the very recent origin of the gene and the gene cluster, the analysis of the composition of this gene cluster represents an excellent opportunity to study the origin and evolution of new gene functions and the fate of gene duplications. We have analyzed the nucleotide sequence of this region and reconstructed the evolutionary history of this gene cluster. We found that the cluster is composed by four tandem copies of Sdic; these duplicates are very similar but can be distinguished by the unique pattern of insertions, deletions, and point mutations in each copy. The oldest gene copy in the array has a 3' exon that has undergone accelerated diversification, and also shows divergent regulatory sequences. Moreover, there is evidence that this might be the only gene copy in the tandem array that is transcribed at a significant level, expressing a novel sperm-specific protein. There is also a retrotransposon located at the 3' end of each Sdic gene copy. We argue that this gene cluster was formed in the last two million years by at least three tandem duplications and one retrotransposition event.
新基因的起源以及现有基因新功能的产生是分子进化中的基本过程。Sdic是最近在黑腹果蝇谱系中出现的一个新进化基因。该基因编码一种新型精子运动蛋白。它是一个嵌合基因,由另外两个基因的复制、随后的多次缺失和其他序列重排形成。Sdic基因在X染色体上以多个拷贝存在,推测经历了多次复制形成串联排列的基因簇。鉴于该基因和基因簇起源非常近,对这个基因簇组成的分析代表了一个研究新基因功能的起源与进化以及基因复制命运的绝佳机会。我们分析了该区域的核苷酸序列并重建了这个基因簇的进化历史。我们发现该基因簇由四个串联的Sdic拷贝组成;这些重复序列非常相似,但可以通过每个拷贝中独特的插入、缺失和点突变模式来区分。阵列中最古老的基因拷贝有一个3'外显子,其经历了加速多样化,并且还显示出不同的调控序列。此外,有证据表明这可能是串联阵列中唯一大量转录的基因拷贝,表达一种新型精子特异性蛋白。每个Sdic基因拷贝的3'端还存在一个逆转座子。我们认为这个基因簇是在过去两百万年中由至少三次串联复制和一次逆转座事件形成的。