Kim Seong-Ho, Yi Soojin V
School of Biology, Georgia Institute of Technology, GA, USA.
Mol Biol Evol. 2008 Apr;25(4):634-42. doi: 10.1093/molbev/msm281. Epub 2007 Dec 19.
It is often stated that patterns of nonsynonymous rate variation among mammalian lineages are more irregular than expected or overdispersed under the neutral model, whereas synonymous sites conform to the neutral model. Here we reexamined genome-wide patterns of the variance to mean ratio, or index of dispersion (R), of substitutions in proteins from human, mouse, and dog. Contrary to the prevailing notion, we found that the mean index of dispersion for nonsynonymous sites of mammalian proteins is not significantly different from 1. We propose that earlier analyses were biased because the data included disproportionately more protein hormones, which tend to be more dispersed than genes in other functional categories. Synonymous sites exhibit greater degree of dispersion than nonsynonymous sites, although similar to earlier estimates and potentially due to errors associated with correction for multiple hits. Overall, our analysis identifies strong genome-wide generation-time effect and natural selection as important determinants of among-lineage variation of protein evolutionary rates. Furthermore, patterns of lineage-specific selective constraint are consistent with the nearly neutral model of molecular evolution.
人们常说,在中性模型下,哺乳动物谱系间非同义突变率的变化模式比预期的更不规则或过度离散,而同义位点则符合中性模型。在此,我们重新审视了人类、小鼠和狗蛋白质中替换的方差均值比或离散指数(R)的全基因组模式。与普遍观点相反,我们发现哺乳动物蛋白质非同义位点的平均离散指数与1没有显著差异。我们认为,早期的分析存在偏差,因为数据中蛋白质激素的比例过高,而蛋白质激素往往比其他功能类别的基因更离散。同义位点比非同义位点表现出更大程度的离散,尽管与早期估计相似,这可能是由于与多重打击校正相关的误差。总体而言,我们的分析确定了全基因组范围内强大的世代时间效应和自然选择是蛋白质进化速率谱系间变异的重要决定因素。此外,谱系特异性选择约束模式与分子进化的近中性模型一致。