Division of Population Genetics, National Institute of Genetics, Mishima, Japan.
Genome Biol Evol. 2011;3:1119-35. doi: 10.1093/gbe/evr074. Epub 2011 Aug 22.
Duplogs, or intraspecies paralogs, constitute the important portion of eukaryote genomes and serve as a major source of functional innovation. We conducted detailed analyses of recently emerged animal duplogs. Genome data of three vertebrate species (Homo sapiens, Mus musculus, and Danio rerio), Caenorhabditis elegans, and two Drosophila species (Drosophila melanogaster and D. pseudoobscura) were used. Duplication events were divided into six age-groups according to the synonymous distance (dS) up to 0.6. Duplogs were classified into four equal-sized classes on physical distances and into three classes on relative orientations. We observed the following shared characteristics among intrachromosomal multiexon duplogs: 1) inverted duplogs account for 20-50%, and about a half of the physically most distant 25%; 2) except for C. elegans, the composition of physical distances, that of relative orientations, and the proportion of inverted duplogs in each physical distance category are more or less uniform; 3) except for C. elegans, the characteristics of the youngest (dS < 0.01) duplogs are similar to the overall characteristics of the entire set. These results suggest that intrachromosomal duplogs with fairly long physical distances were generated at once, rather than resulting from tandem duplications and subsequent genomic rearrangements. This is different from the three well-known modes of gene duplication: tandem duplication, retrotransposition, and genome duplication. We termed this new mode as "drift" duplication. The drift duplication has been producing duplicate copies at paces comparable with tandem duplications since the common ancestor of vertebrates, and it may have already operated in the common ancestor of bilateral animals.
复制基因,或种内旁系同源基因,构成了真核生物基因组的重要部分,是功能创新的主要来源。我们对新出现的动物复制基因进行了详细分析。使用了三个脊椎动物物种(智人、小家鼠和斑马鱼)、秀丽隐杆线虫和两种果蝇(黑腹果蝇和 D. pseudoobscura)的基因组数据。根据同义距离(dS)达到 0.6 将复制事件分为六个年龄组。根据物理距离将复制基因分为四个相等大小的类,根据相对方向分为三个类。我们观察到染色体内多外显子复制基因之间存在以下共同特征:1)倒位复制基因占 20-50%,大约一半的物理距离最远的 25%;2)除秀丽隐杆线虫外,物理距离、相对方向组成以及每个物理距离类别的倒位复制基因比例或多或少是均匀的;3)除秀丽隐杆线虫外,最年轻(dS < 0.01)的复制基因的特征与整个基因集的总体特征相似。这些结果表明,具有相当长物理距离的染色体内复制基因是一次性产生的,而不是由串联复制和随后的基因组重排产生的。这与三种众所周知的基因复制模式:串联复制、反转录转座和基因组复制不同。我们将这种新的模式称为“漂移”复制。自脊椎动物的共同祖先以来,漂移复制一直在以与串联复制相当的速度产生重复副本,并且它可能已经在两侧对称动物的共同祖先中起作用。