Department of Chemistry, Brown University, Providence, Rhode Island, United States of America.
Department of Ecology and Evolutionary Biology, Brown University, Providence, Rhode Island, United States of America.
PLoS One. 2020 May 29;15(5):e0233509. doi: 10.1371/journal.pone.0233509. eCollection 2020.
One of the long-standing holy grails of molecular evolution has been the ability to predict an organism's fitness directly from its genotype. With such predictive abilities in hand, researchers would be able to more accurately forecast how organisms will evolve and how proteins with novel functions could be engineered, leading to revolutionary advances in medicine and biotechnology. In this work, we assemble the largest reported set of experimental TEM-1 β-lactamase folding free energies and use this data in conjunction with previously acquired fitness data and computational free energy predictions to determine how much of the fitness of β-lactamase can be directly predicted by thermodynamic folding and binding free energies. We focus upon β-lactamase because of its long history as a model enzyme and its central role in antibiotic resistance. Based upon a set of 21 β-lactamase single and double mutants expressly designed to influence protein folding, we first demonstrate that modeling software designed to compute folding free energies such as FoldX and PyRosetta can meaningfully, although not perfectly, predict the experimental folding free energies of single mutants. Interestingly, while these techniques also yield sensible double mutant free energies, we show that they do so for the wrong physical reasons. We then go on to assess how well both experimental and computational folding free energies explain single mutant fitness. We find that folding free energies account for, at most, 24% of the variance in β-lactamase fitness values according to linear models and, somewhat surprisingly, complementing folding free energies with computationally-predicted binding free energies of residues near the active site only increases the folding-only figure by a few percent. This strongly suggests that the majority of β-lactamase's fitness is controlled by factors other than free energies. Overall, our results shed a bright light on to what extent the community is justified in using thermodynamic measures to infer protein fitness as well as how applicable modern computational techniques for predicting free energies will be to the large data sets of multiply-mutated proteins forthcoming.
分子进化中长期以来的圣杯之一是能够直接从基因型预测生物体的适应度。有了这种预测能力,研究人员将能够更准确地预测生物体如何进化以及如何设计具有新功能的蛋白质,从而在医学和生物技术领域取得革命性的进展。在这项工作中,我们汇集了报道的最大的实验 TEM-1 β-内酰胺酶折叠自由能数据集,并结合以前获得的适应度数据和计算自由能预测,来确定β-内酰胺酶的适应度有多少可以直接通过热力学折叠和结合自由能来预测。我们专注于β-内酰胺酶,因为它作为一种模型酶具有悠久的历史,并且在抗生素耐药性方面发挥着核心作用。基于一组专门设计用于影响蛋白质折叠的 21 个β-内酰胺酶单突变体和双突变体,我们首先证明旨在计算折叠自由能的建模软件,例如 FoldX 和 PyRosetta,可以有意义地(尽管不是完美地)预测单突变体的实验折叠自由能。有趣的是,尽管这些技术也能得出合理的双突变体自由能,但我们表明它们这样做的物理原因是错误的。然后,我们继续评估实验和计算折叠自由能在多大程度上解释单突变体的适应度。我们发现,根据线性模型,折叠自由能最多可以解释β-内酰胺酶适应度值的 24%,而且令人惊讶的是,在用计算预测的活性位点附近残基的结合自由能补充折叠自由能时,仅将折叠自由能的数值提高了几个百分点。这强烈表明,β-内酰胺酶的大部分适应度是由自由能以外的因素控制的。总体而言,我们的结果阐明了在多大程度上可以使用热力学测量来推断蛋白质适应度,以及现代预测自由能的计算技术在多大程度上适用于即将出现的大量突变蛋白质的大数据集。