Zou Zhengting, Zhang Jianzhi
Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor.
Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor
Mol Biol Evol. 2015 Aug;32(8):2085-96. doi: 10.1093/molbev/msv091. Epub 2015 Apr 9.
Convergent and parallel amino acid substitutions in protein evolution, collectively referred to as molecular convergence here, have small probabilities under neutral evolution. For this reason, molecular convergence is commonly viewed as evidence for similar adaptations of different species. The surge in the number of reports of molecular convergence in the last decade raises the intriguing question of whether molecular convergence occurs substantially more frequently than expected under neutral evolution. We here address this question using all one-to-one orthologous proteins encoded by the genomes of 12 fruit fly species and those encoded by 17 mammals. We found that the expected amount of molecular convergence varies greatly depending on the specific neutral substitution model assumed at each amino acid site and that the observed amount of molecular convergence is explainable by neutral models incorporating site-specific information of acceptable amino acids. Interestingly, the total number of convergent and parallel substitutions between two lineages, relative to the neutral expectation, decreases with the genetic distance between the two lineages, regardless of the model used in computing the neutral expectation. We hypothesize that this trend results from differences in the amino acids acceptable at a given site among different clades of a phylogeny, due to prevalent epistasis, and provide simulation as well as empirical evidence for this hypothesis. Together, our study finds no genomic evidence for higher-than-neutral levels of molecular convergence, but suggests the presence of abundant epistasis that decreases the likelihood of molecular convergence between distantly related lineages.
蛋白质进化中的趋同和平行氨基酸替换,在这里统称为分子趋同,在中性进化下发生的概率很小。因此,分子趋同通常被视为不同物种具有相似适应性的证据。在过去十年中,分子趋同报告数量的激增引发了一个有趣的问题:分子趋同是否比中性进化预期的更频繁地发生。我们在这里使用12种果蝇物种基因组编码的所有一对一直系同源蛋白以及17种哺乳动物编码的蛋白来解决这个问题。我们发现,预期的分子趋同数量因每个氨基酸位点假设的特定中性替换模型而异,并且观察到的分子趋同数量可以通过纳入可接受氨基酸位点特异性信息的中性模型来解释。有趣的是,两个谱系之间趋同和平行替换的总数相对于中性预期而言,会随着两个谱系之间的遗传距离而减少,无论在计算中性预期时使用何种模型。我们假设这种趋势是由于系统发育中不同分支在给定位点可接受的氨基酸存在差异,这是由于普遍存在的上位性导致的,并为这一假设提供了模拟和实证证据。总之,我们的研究没有发现基因组证据表明分子趋同水平高于中性水平,但表明存在大量上位性,这降低了远缘谱系之间分子趋同的可能性。