Department of Chemistry, Shanghai Key Laboratory of Molecular Catalysis and Innovative Materials, Fudan University, Shanghai, 200433, China.
J Comput Chem. 2013 Jul 15;34(19):1636-46. doi: 10.1002/jcc.23303. Epub 2013 May 16.
Targeted therapy is currently a hot topic in the fields of cancer research and drug design. An important requirement for this approach is the development of potent and selective inhibitors for the identified target protein. However, current ways to estimate inhibitor efficacy rely on empirical protein-ligand interaction scoring functions which, suffering from their heavy parameterizations, often lead to a low accuracy. In this work, we develop a nonfitting scoring function, which consists of three terms: (1) gas-phase protein-ligand binding enthalpy obtained by the eXtended ONIOM hybrid method based on an integration of density functional theory (DFT) methods (XYG3 and ωB97X-D) and the semiempirical PM6 method, (2) solvation free energy based on DFT-SMD solvation model, and (3) entropy effect estimated by using DFT frequency analysis. The new scoring function is tested on a cyclin-dependent kinase 2 (CDK2) inhibitor database including 76 CDK2 protein inhibitors and a p21-activated kinase 1 (PAK1) inhibitor database including 20 organometallic PAK1 protein inhibitors. From the results, good correlations are found between the calculated scores and the experimental inhibitor efficacies with the square of correlation coefficient R(2) of 0.76-0.88. This suggests a good predictive power of this scoring function. To the best of our knowledge, this is the first high level theory-based nonfitting scoring function with such a good level of performance. This scoring function is recommended to be used in the final screening of lead structure derivatives.
靶向治疗目前是癌症研究和药物设计领域的热门话题。这种方法的一个重要要求是开发针对已确定的靶蛋白的有效且选择性的抑制剂。然而,目前估计抑制剂功效的方法依赖于经验性的蛋白质-配体相互作用评分函数,这些函数由于其大量的参数化,往往导致准确性较低。在这项工作中,我们开发了一种非拟合评分函数,它由三个部分组成:(1)基于密度泛函理论(DFT)方法(XYG3 和 ωB97X-D)和半经验 PM6 方法的扩展 ONIOM 混合方法获得的气相蛋白-配体结合焓,(2)基于 DFT-SMD 溶剂化模型的溶剂化自由能,以及(3)使用 DFT 频率分析估计的熵效应。新的评分函数在包括 76 种 CDK2 蛋白抑制剂的细胞周期蛋白依赖性激酶 2(CDK2)抑制剂数据库和包括 20 种有机金属 PAK1 蛋白抑制剂的 p21 激活激酶 1(PAK1)抑制剂数据库上进行了测试。从结果中可以发现,计算得分与实验抑制剂功效之间存在良好的相关性,相关系数 R²为 0.76-0.88。这表明该评分函数具有良好的预测能力。据我们所知,这是第一个具有如此良好性能的基于高水平理论的非拟合评分函数。建议将该评分函数用于先导结构衍生物的最终筛选。