Bierne Nicolas, Eyre-Walker Adam
Centre for the Study of Evolution and School of Biological Sciences, University of Sussex, Brighton, UK.
Mol Biol Evol. 2004 Jul;21(7):1350-60. doi: 10.1093/molbev/msh134. Epub 2004 Mar 24.
The proportion of amino acid substitutions driven by adaptive evolution can potentially be estimated from polymorphism and divergence data by an extension of the McDonald-Kreitman test. We have developed a maximum-likelihood method to do this and have applied our method to several data sets from three Drosophila species: D. melanogaster, D. simulans, and D. yakuba. The estimated number of adaptive substitutions per codon is not uniformly distributed among genes, but follows a leptokurtic distribution. However, the proportion of amino acid substitutions fixed by adaptive evolution seems to be remarkably constant across the genome (i.e., the proportion of amino acid substitutions that are adaptive appears to be the same in fast-evolving and slow-evolving genes; fast-evolving genes have higher numbers of both adaptive and neutral substitutions). Our estimates do not seem to be significantly biased by selection on synonymous codon use or by the assumption of independence among sites. Nevertheless, an accurate estimate is hampered by the existence of slightly deleterious mutations and variations in effective population size. The analysis of several Drosophila data sets suggests that approximately 25% +/- 20% of amino acid substitutions were driven by positive selection in the divergence between D. simulans and D. yakuba.
通过扩展麦克唐纳-克里特曼检验,有可能从多态性和分化数据中估计由适应性进化驱动的氨基酸替换比例。我们已经开发出一种最大似然法来进行此项工作,并将我们的方法应用于来自三种果蝇物种(黑腹果蝇、拟暗果蝇和雅库布果蝇)的几个数据集。每个密码子的适应性替换估计数量在基因间并非均匀分布,而是遵循尖峰态分布。然而,适应性进化固定的氨基酸替换比例在整个基因组中似乎相当恒定(即,适应性氨基酸替换的比例在快速进化和缓慢进化的基因中似乎相同;快速进化的基因中适应性和中性替换的数量都更多)。我们的估计似乎没有因同义密码子使用选择或位点间独立性假设而产生显著偏差。尽管如此,轻微有害突变的存在和有效种群大小的变化阻碍了准确估计。对几个果蝇数据集的分析表明,在拟暗果蝇和雅库布果蝇的分化过程中,约25%±20%的氨基酸替换是由正选择驱动的。