Program in Bioinformatics and Integrative Biology, University of Massachusetts Medical School, 368 Plantation St, Worcester, MA.
Mol Biol Evol. 2018 Jan 1;35(1):211-224. doi: 10.1093/molbev/msx282.
Prokaryotes evolved to thrive in an extremely diverse set of habitats, and their proteomes bear signatures of environmental conditions. Although correlations between amino acid usage and environmental temperature are well-documented, understanding of the mechanisms of thermal adaptation remains incomplete. Here, we couple the energetic costs of protein folding and protein homeostasis to build a microscopic model explaining both the overall amino acid composition and its temperature trends. Low biosynthesis costs lead to low diversity of physical interactions between amino acid residues, which in turn makes proteins less stable and drives up chaperone activity to maintain appropriate levels of folded, functional proteins. Assuming that the cost of chaperone activity is proportional to the fraction of unfolded client proteins, we simulated thermal adaptation of model proteins subject to minimization of the total cost of amino acid synthesis and chaperone activity. For the first time, we predicted both the proteome-average amino acid abundances and their temperature trends simultaneously, and found strong correlations between model predictions and 402 genomes of bacteria and archaea. The energetic constraint on protein evolution is more apparent in highly expressed proteins, selected by codon adaptation index. We found that in bacteria, highly expressed proteins are similar in composition to thermophilic ones, whereas in archaea no correlation between predicted expression level and thermostability was observed. At the same time, thermal adaptations of highly expressed proteins in bacteria and archaea are nearly identical, suggesting that universal energetic constraints prevail over the phylogenetic differences between these domains of life.
原核生物进化到能够在极其多样化的生境中茁壮成长,其蛋白质组具有环境条件的特征。虽然氨基酸使用与环境温度之间的相关性已有充分的记录,但对热适应的机制仍不完全了解。在这里,我们将蛋白质折叠和蛋白质平衡的能量成本联系起来,构建了一个微观模型,解释了整体氨基酸组成及其温度趋势。低生物合成成本导致氨基酸残基之间的物理相互作用多样性较低,这反过来又使蛋白质不太稳定,并增加了伴侣蛋白的活性以维持适当水平的折叠、功能蛋白。假设伴侣蛋白活性的成本与未折叠客户蛋白的分数成正比,我们模拟了在最小化氨基酸合成和伴侣蛋白活性总成本的条件下,模型蛋白的热适应。我们首次同时预测了蛋白质组平均氨基酸丰度及其温度趋势,并发现模型预测与 402 个细菌和古菌基因组之间存在很强的相关性。蛋白质进化的能量限制在高表达蛋白中更为明显,这是由密码子适应指数选择的。我们发现,在细菌中,高表达蛋白的组成与嗜热蛋白相似,而在古菌中,预测的表达水平与热稳定性之间没有相关性。与此同时,细菌和古菌中高表达蛋白的热适应几乎相同,这表明普遍的能量限制超过了这些生命领域之间的系统发育差异。