Klesmith Justin R, Bacik John-Paul, Wrenbeck Emily E, Michalczyk Ryszard, Whitehead Timothy A
Department of Biochemistry and Molecular Biology, Michigan State University, East Lansing, MI 48824.
Bioscience Division, Los Alamos National Laboratory, Los Alamos, NM 87545.
Proc Natl Acad Sci U S A. 2017 Feb 28;114(9):2265-2270. doi: 10.1073/pnas.1614437114. Epub 2017 Feb 14.
Proteins are marginally stable, and an understanding of the sequence determinants for improved protein solubility is highly desired. For enzymes, it is well known that many mutations that increase protein solubility decrease catalytic activity. These competing effects frustrate efforts to design and engineer stable, active enzymes without laborious high-throughput activity screens. To address the trade-off between enzyme solubility and activity, we performed deep mutational scanning using two different screens/selections that purport to gauge protein solubility for two full-length enzymes. We assayed a TEM-1 beta-lactamase variant and levoglucosan kinase (LGK) using yeast surface display (YSD) screening and a twin-arginine translocation pathway selection. We then compared these scans with published experimental fitness landscapes. Results from the YSD screen could explain 37% of the variance in the fitness landscapes for one enzyme. Five percent to 10% of all single missense mutations improve solubility, matching theoretical predictions of global protein stability. For a given solubility-enhancing mutation, the probability that it would retain wild-type fitness was correlated with evolutionary conservation and distance to active site, and anticorrelated with contact number. Hybrid classification models were developed that could predict solubility-enhancing mutations that maintain wild-type fitness with an accuracy of 90%. The downside of using such classification models is the removal of rare mutations that improve both fitness and solubility. To reveal the biophysical basis of enhanced protein solubility and function, we determined the crystallographic structure of one such LGK mutant. Beyond fundamental insights into trade-offs between stability and activity, these results have potential biotechnological applications.
蛋白质的稳定性较差,因此人们非常希望了解提高蛋白质溶解度的序列决定因素。对于酶来说,众所周知,许多增加蛋白质溶解度的突变会降低催化活性。这些相互矛盾的影响阻碍了在不进行费力的高通量活性筛选的情况下设计和改造稳定、有活性的酶的努力。为了解决酶溶解度和活性之间的权衡问题,我们使用了两种不同的筛选/选择方法进行深度突变扫描,这两种方法旨在评估两种全长酶的蛋白质溶解度。我们使用酵母表面展示(YSD)筛选和双精氨酸转运途径选择来检测一种TEM-1β-内酰胺酶变体和左旋葡聚糖激酶(LGK)。然后,我们将这些扫描结果与已发表的实验适应性景观进行了比较。YSD筛选的结果可以解释一种酶适应性景观中37%的变异。所有单错义突变中有5%到10%提高了溶解度,这与全球蛋白质稳定性的理论预测相符。对于给定的溶解度增强突变,其保留野生型适应性的概率与进化保守性和与活性位点的距离相关,与接触数呈反相关。我们开发了混合分类模型,该模型可以预测维持野生型适应性的溶解度增强突变,准确率为90%。使用这种分类模型的缺点是去除了同时提高适应性和溶解度的罕见突变。为了揭示蛋白质溶解度和功能增强的生物物理基础,我们确定了一种这样的LGK突变体的晶体结构。除了对稳定性和活性之间权衡的基本见解之外,这些结果还具有潜在的生物技术应用。