Institute of Cytology, Russian Academy of Sciences, St. Petersburg 194064, Russia.
J Mol Evol. 2012 Aug;75(1-2):25-33. doi: 10.1007/s00239-012-9516-1. Epub 2012 Aug 25.
Proteome complexity increases in the evolution mostly by means of gene duplication followed by divergence. In this genome-scale study of human genome I show that density distribution of duplicate gene pairs along the axis of protein divergence between pair members forms two main peaks with a small peak and plateau before the first main peak. This picture indicates the existence of three evolutionary stages of duplicate gene evolution. The analysis of various functional parameters (gene expression level and breadth, transcription factor targets, protein interaction networks) suggests that subfunctionalization (partition of function) is a predominant mode of divergence in the first main peak, whereas neofunctionalization (acquiring of novel functions) prevails in the second main peak. The young duplicate pairs show a much higher expression level compared with singleton genes and more diverged duplicates, which indicates that requirement for high gene dosage is important for retention of duplicates just after the duplication event. Thus, a prevailing route of duplicate evolution seems to be the high gene dosage-subfunctionalization-neofunctionalization. This adaptationist model suggests that an organism is evolving in the direction of its most intensively used functions.
蛋白质组的复杂性在进化过程中主要通过基因复制增加,然后是基因分化。在这项人类基因组的全基因组研究中,我表明,在成员之间的蛋白质分化轴上,重复基因对的密度分布形成了两个主要峰值,在第一个主要峰值之前有一个小峰值和一个平台。这表明重复基因进化存在三个进化阶段。对各种功能参数(基因表达水平和广度、转录因子靶标、蛋白质相互作用网络)的分析表明,在第一个主要峰值中,亚功能化(功能划分)是分化的主要模式,而在第二个主要峰值中,新功能化(获得新功能)占主导地位。年轻的重复对与单基因和更多分化的重复相比,表现出更高的表达水平,这表明高基因剂量对于复制事件后重复基因的保留是很重要的。因此,重复进化的主要途径似乎是高基因剂量-亚功能化-新功能化。这种适应主义模型表明,生物体的进化方向是其最集中使用的功能。