Department of Biochemistry and Institute for Protein Design, University of Washington, Seattle, Washington 98195, United States.
Howard Hughes Medical Institute, University of Washington, Seattle, Washington 98195, United States.
J Chem Theory Comput. 2021 Mar 9;17(3):2000-2010. doi: 10.1021/acs.jctc.0c01184. Epub 2021 Feb 12.
Accurate and rapid calculation of protein-small molecule interaction free energies is critical for computational drug discovery. Because of the large chemical space spanned by drug-like molecules, classical force fields contain thousands of parameters describing atom-pair distance and torsional preferences; each parameter is typically optimized independently on simple representative molecules. Here, we describe a new approach in which small molecule force field parameters are jointly optimized guided by the rich source of information contained within thousands of available small molecule crystal structures. We optimize parameters by requiring that the experimentally determined molecular lattice arrangements have lower energy than all alternative lattice arrangements. Thousands of independent crystal lattice-prediction simulations were run on each of 1386 small molecule crystal structures, and energy function parameters of an implicit solvent energy model were optimized, so native crystal lattice arrangements had the lowest energy. The resulting energy model was implemented in Rosetta, together with a rapid genetic algorithm docking method employing grid-based scoring and receptor flexibility. The success rate of bound structure recapitulation in cross-docking on 1112 complexes was improved by more than 10% over previously published methods, with solutions within <1 Å in over half of the cases. Our results demonstrate that small molecule crystal structures are a rich source of information for guiding molecular force field development, and the improved Rosetta energy function should increase accuracy in a wide range of small molecule structure prediction and design studies.
准确快速地计算蛋白质-小分子相互作用自由能对于计算药物发现至关重要。由于类药性分子所涵盖的化学空间巨大,经典力场包含数千个描述原子对距离和扭转偏好的参数;每个参数通常都是在简单的代表性分子上独立优化的。在这里,我们描述了一种新方法,即通过从数千个可用小分子晶体结构中包含的丰富信息指导,共同优化小分子力场参数。我们通过要求实验确定的分子晶格排列比所有其他晶格排列具有更低的能量来优化参数。对 1386 个小分子晶体结构中的每一个结构都进行了数千个独立的晶体晶格预测模拟,优化了隐式溶剂能量模型的能量函数参数,以使天然晶体晶格排列具有最低能量。所得能量模型与 Rosetta 一起实现,同时还采用了基于网格的打分和受体灵活性的快速遗传算法对接方法。在 1112 个复合物的对接交叉对接中,结合结构再现的成功率比以前发表的方法提高了 10%以上,超过一半的情况的解决方案在 <1 Å 以内。我们的结果表明,小分子晶体结构是指导分子力场发展的丰富信息来源,改进的 Rosetta 能量函数应该会提高广泛的小分子结构预测和设计研究的准确性。