National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA.
Mol Biol Evol. 2010 Mar;27(3):735-41. doi: 10.1093/molbev/msp270. Epub 2009 Nov 12.
The sequences of proteins encoded by a genome evolve at different rates. A correlate of a protein's evolutionary rate is its expression level: highly expressed proteins tend to evolve slowly. Some explanations of rate variation and the correlation between rate and expression predict that more slowly evolving and more highly expressed proteins have more favorable equilibrium constants for folding. Proteins from thermophiles generally have more stable folds than proteins from mesophiles, and it is known that there are systematic differences in amino acid content between thermophilic and mesophilic proteins. I examined whether there are analogous correlations of amino acid frequencies with evolutionary rate and expression level within genomes. In most of the organisms analyzed, there is a striking tendency for more slowly evolving proteins to be more thermophile-like in their amino acid compositions when adjustments are made for variation in GC content. More highly expressed proteins also tend to be more thermophile-like by the same criteria. These results suggest that part of the evolutionary rate variation among proteins is due to variation in the strength of selection for stability of the folded state. They also suggest that increasing strength of this selective force with expression level plays a role in the correlation between evolutionary rate and expression level.
基因组编码的蛋白质序列以不同的速度进化。蛋白质进化速度的一个相关因素是其表达水平:高表达的蛋白质往往进化得较慢。一些关于速率变化和速率与表达之间相关性的解释预测,进化较慢和表达较高的蛋白质具有更有利的折叠平衡常数。与中温生物相比,嗜热生物的蛋白质通常具有更稳定的折叠结构,而且已知嗜热生物和中温生物的蛋白质在氨基酸组成上存在系统差异。我研究了在基因组内,氨基酸频率与进化率和表达水平之间是否存在类似的相关性。在分析的大多数生物体中,当调整 GC 含量的变化时,进化较慢的蛋白质在其氨基酸组成上更倾向于类似嗜热生物,而高表达的蛋白质也倾向于更类似嗜热生物。这些结果表明,蛋白质之间进化率变化的一部分是由于折叠状态稳定性选择强度的变化所致。它们还表明,随着表达水平的增加,这种选择力的强度增加在进化率和表达水平之间的相关性中起着作用。