Gallicchio Emilio, Deng Nanjie, He Peng, Wickstrom Lauren, Perryman Alexander L, Santiago Daniel N, Forli Stefano, Olson Arthur J, Levy Ronald M
Department of Chemistry and Chemical Biology, Rutgers The State University of New Jersey, Piscataway, NJ, 08854, USA,
J Comput Aided Mol Des. 2014 Apr;28(4):475-90. doi: 10.1007/s10822-014-9711-9. Epub 2014 Feb 7.
As part of the SAMPL4 blind challenge, filtered AutoDock Vina ligand docking predictions and large scale binding energy distribution analysis method binding free energy calculations have been applied to the virtual screening of a focused library of candidate binders to the LEDGF site of the HIV integrase protein. The computational protocol leveraged docking and high level atomistic models to improve enrichment. The enrichment factor of our blind predictions ranked best among all of the computational submissions, and second best overall. This work represents to our knowledge the first example of the application of an all-atom physics-based binding free energy model to large scale virtual screening. A total of 285 parallel Hamiltonian replica exchange molecular dynamics absolute protein-ligand binding free energy simulations were conducted starting from docked poses. The setup of the simulations was fully automated, calculations were distributed on multiple computing resources and were completed in a 6-weeks period. The accuracy of the docked poses and the inclusion of intramolecular strain and entropic losses in the binding free energy estimates were the major factors behind the success of the method. Lack of sufficient time and computing resources to investigate additional protonation states of the ligands was a major cause of mispredictions. The experiment demonstrated the applicability of binding free energy modeling to improve hit rates in challenging virtual screening of focused ligand libraries during lead optimization.
作为SAMPL4盲测挑战的一部分,已将经过筛选的AutoDock Vina配体对接预测结果以及大规模结合能分布分析方法的结合自由能计算应用于对HIV整合酶蛋白LEDGF位点候选结合物聚焦文库的虚拟筛选。该计算方案利用对接和高级原子模型来提高富集度。我们的盲测预测富集因子在所有计算提交结果中排名第一,在总体排名中位列第二。据我们所知,这项工作代表了基于全原子物理的结合自由能模型在大规模虚拟筛选中的首次应用实例。从对接构象开始,总共进行了285次并行哈密顿复制交换分子动力学绝对蛋白质-配体结合自由能模拟。模拟设置完全自动化,计算分布在多个计算资源上,并在6周内完成。对接构象的准确性以及结合自由能估计中包含分子内应变和熵损失是该方法成功的主要因素。缺乏足够的时间和计算资源来研究配体的其他质子化状态是预测错误的主要原因。该实验证明了结合自由能建模在先导优化过程中对具有挑战性的聚焦配体文库虚拟筛选中提高命中率的适用性。