Hanada Kousuke, Shiu Shin-Han, Li Wen-Hsiung
Department of Ecology and Evolution, University of Chicago, USA.
Mol Biol Evol. 2007 Oct;24(10):2235-41. doi: 10.1093/molbev/msm152. Epub 2007 Jul 25.
There are 2 ways to infer selection pressures in the evolution of protein-coding genes, the nonsynonymous and synonymous substitution rate ratio (K(A)/K(S)) and the radical and conservative amino acid replacement rate ratio (K(R)/K(C)). Because the K(R)/K(C) ratio depends on the definition of radical and conservative changes in the classification of amino acids, we develop an amino acid classification that maximizes the correlation between K(A)/K(S) and K(R)/K(C). An analysis of 3,375 orthologous gene groups among 5 mammalian species shows that our classification gives a significantly higher correlation coefficient between the 2 ratios than those of existing classifications. However, there are many orthologous gene groups with a low K(A)/K(S) but a high K(R)/K(C) ratio. Examining the functions of these genes, we found an overrepresentation of functional categories related to development. To determine if the overrepresentation is stage specific, we examined the expression patterns of these genes at different developmental stages of the mouse. Interestingly, these genes are highly expressed in the early middle stage of development (blastocyst to amnion). It is commonly thought that developmental genes tend to be conservative in evolution, but some molecular changes in developmental stages should have contributed to morphological divergence in adult mammals. Therefore, we propose that the relaxed pressures indicated by the K(R)/K(C) ratio but not by K(A)/K(S) in the early middle stage of development may be important for the morphological divergence of mammals at the adult stage, whereas purifying selection detected by K(A)/K(S) occurs in the early middle developmental stage.
推断蛋白质编码基因进化过程中选择压力的方法有两种,即非同义替换率与同义替换率之比(K(A)/K(S))以及激进氨基酸替换率与保守氨基酸替换率之比(K(R)/K(C))。由于K(R)/K(C)比值取决于氨基酸分类中激进和保守变化的定义,我们开发了一种氨基酸分类方法,以使K(A)/K(S)和K(R)/K(C)之间的相关性最大化。对5种哺乳动物的3375个直系同源基因组进行分析表明,与现有分类方法相比,我们的分类方法在这两个比值之间给出了显著更高的相关系数。然而,存在许多直系同源基因组,其K(A)/K(S)较低但K(R)/K(C)比值较高。通过研究这些基因的功能,我们发现与发育相关的功能类别出现了过度富集。为了确定这种过度富集是否具有阶段特异性,我们检查了这些基因在小鼠不同发育阶段的表达模式。有趣的是,这些基因在发育的早期中期(囊胚到羊膜)高度表达。人们通常认为发育基因在进化中倾向于保守,但发育阶段的一些分子变化应该对成年哺乳动物的形态差异有所贡献。因此,我们提出,发育早期中期由K(R)/K(C)比值而非K(A)/K(S)所表明的放松选择压力可能对成年阶段哺乳动物的形态差异很重要,而K(A)/K(S)检测到的纯化选择则发生在发育的早期中期阶段。