Carlin Dylan Alexander, Caster Ryan W, Wang Xiaokang, Betzenderfer Stephanie A, Chen Claire X, Duong Veasna M, Ryklansky Carolina V, Alpekin Alp, Beaumont Nathan, Kapoor Harshul, Kim Nicole, Mohabbot Hosna, Pang Boyu, Teel Rachel, Whithaus Lillian, Tagkopoulos Ilias, Siegel Justin B
Biophysics Graduate Group, University of California Davis, California, United States of America.
Genome Center, University of California Davis, Davis, California, United States of America.
PLoS One. 2016 Jan 27;11(1):e0147596. doi: 10.1371/journal.pone.0147596. eCollection 2016.
The use of computational modeling algorithms to guide the design of novel enzyme catalysts is a rapidly growing field. Force-field based methods have now been used to engineer both enzyme specificity and activity. However, the proportion of designed mutants with the intended function is often less than ten percent. One potential reason for this is that current force-field based approaches are trained on indirect measures of function rather than direct correlation to experimentally-determined functional effects of mutations. We hypothesize that this is partially due to the lack of data sets for which a large panel of enzyme variants has been produced, purified, and kinetically characterized. Here we report the kcat and KM values of 100 purified mutants of a glycoside hydrolase enzyme. We demonstrate the utility of this data set by using machine learning to train a new algorithm that enables prediction of each kinetic parameter based on readily-modeled structural features. The generated dataset and analyses carried out in this study not only provide insight into how this enzyme functions, they also provide a clear path forward for the improvement of computational enzyme redesign algorithms.
使用计算建模算法来指导新型酶催化剂的设计是一个快速发展的领域。基于力场的方法现已用于改造酶的特异性和活性。然而,具有预期功能的设计突变体的比例通常不到10%。造成这种情况的一个潜在原因是,当前基于力场的方法是根据功能的间接测量进行训练的,而不是与突变的实验确定的功能效应直接相关。我们推测,这部分是由于缺乏已产生、纯化并进行动力学表征的大量酶变体数据集。在此,我们报告了一种糖苷水解酶的100个纯化突变体的催化常数(kcat)和米氏常数(KM)值。我们通过使用机器学习来训练一种新算法,证明了该数据集的实用性,该算法能够根据易于建模的结构特征预测每个动力学参数。本研究中生成的数据集和进行的分析不仅深入了解了这种酶的功能,还为改进计算酶重新设计算法提供了一条清晰的前进道路。