Kondrashov Alexey S, Sunyaev Shamil, Kondrashov Fyodor A
National Center for Biotechnology Information, National Institutes of Health, Bethesda, MD 20894, USA.
Proc Natl Acad Sci U S A. 2002 Nov 12;99(23):14878-83. doi: 10.1073/pnas.232565499. Epub 2002 Oct 28.
We study fitness landscape in the space of protein sequences by relating sets of human pathogenic missense mutations in 32 proteins to amino acid substitutions that occurred in the course of evolution of these proteins. On average, approximately 10% of deviations of a nonhuman protein from its human ortholog are compensated pathogenic deviations (CPDs), i.e., are caused by an amino acid substitution that, at this site, would be pathogenic to humans. Normal functioning of a CPD-containing protein must be caused by other, compensatory deviations of the nonhuman species from humans. Together, a CPD and the corresponding compensatory deviation form a Dobzhansky-Muller incompatibility that can be visualized as the corner on a fitness ridge. Thus, proteins evolve along fitness ridges which contain only approximately 10 steps between successive corners. The fraction of CPDs among all deviations of a protein from its human ortholog does not increase with the evolutionary distance between the proteins, indicating that substitutions that carry evolving proteins around these corners occur in rapid succession, driven by positive selection. Data on fitness of interspecies hybrids suggest that the compensatory change that makes a CPD fit usually occurs within the same protein. Data on protein structures and on cooccurrence of amino acids at different sites of multiple orthologous proteins often make it possible to provisionally identify the substitution that compensates a particular CPD.
我们通过将32种蛋白质中的人类致病性错义突变集与这些蛋白质进化过程中发生的氨基酸替换相关联,来研究蛋白质序列空间中的适应度景观。平均而言,非人类蛋白质与其人类直系同源蛋白之间约10%的偏差是补偿性致病偏差(CPD),即由在该位点对人类致病的氨基酸替换引起。含有CPD的蛋白质的正常功能必定由非人类物种与人类之间的其他补偿性偏差导致。CPD与相应的补偿性偏差共同构成了一种多布赞斯基-穆勒不相容性,可将其视为适应度脊上的拐角。因此,蛋白质沿着适应度脊进化,连续拐角之间仅包含约10个步骤。蛋白质与其人类直系同源蛋白之间所有偏差中CPD的比例不会随着蛋白质之间的进化距离增加,这表明围绕这些拐角携带进化中蛋白质的替换是在正选择驱动下快速连续发生的。种间杂种适应度的数据表明,使CPD变得适合的补偿性变化通常发生在同一蛋白质内。关于蛋白质结构以及多个直系同源蛋白质不同位点氨基酸共现的数据常常使得临时确定补偿特定CPD的替换成为可能。