Applied Science, BioPharmics LLC, Santa Rosa, California 95404, United States.
Computer-Assisted Drug-Design, Bristol-Myers Squibb Company, Princeton, New Jersey 08648, United States.
J Chem Inf Model. 2021 Dec 27;61(12):5948-5966. doi: 10.1021/acs.jcim.1c01382. Epub 2021 Dec 10.
We present results on the extent to which physics-based simulation (exemplified by FEP) and focused machine learning (exemplified by QuanSA) are complementary for ligand affinity prediction. For both methods, predictions of activity for LFA-1 inhibitors from a medicinal chemistry lead optimization project were accurate within the applicable domain of each approach. A hybrid model that combined predictions by both approaches by simple averaging performed better than either method, with respect to both ranking and absolute p values. Two publicly available FEP benchmarks, covering 16 diverse biological targets, were used to test the generality of the synergy. By identifying training data specifically focused on relevant ligands, accurate QuanSA models were derived using ligand activity data known at the time of the original series publications. Results across the 16 benchmark targets demonstrated significant improvements both for ranking and for absolute p values using hybrid predictions that combined the FEP and QuanSA predicted affinity values. The results argue for a combined approach for affinity prediction that makes use of physics-driven methods as well as those driven by machine learning, each applied carefully on appropriate compounds, with hybrid prediction strategies being employed where possible.
我们展示了基于物理的模拟(以 FEP 为代表)和集中机器学习(以 QuanSA 为代表)在配体亲和力预测方面互补的程度。对于这两种方法,来自药物化学先导优化项目的 LFA-1 抑制剂的活性预测在每种方法的适用范围内都是准确的。通过简单平均结合两种方法的预测的混合模型在排序和绝对 p 值方面均优于任何一种方法。使用两个公开的 FEP 基准测试,涵盖了 16 个不同的生物靶标,测试了协同作用的普遍性。通过专门针对相关配体的识别训练数据,使用在原始系列出版物发布时已知的配体活性数据,成功得出了准确的 QuanSA 模型。在 16 个基准目标上的结果表明,使用结合了 FEP 和 QuanSA 预测亲和力值的混合预测,在排序和绝对 p 值方面都有显著的提高。结果证明,对于亲和力预测,需要采用物理驱动的方法和机器学习驱动的方法相结合,每种方法都要谨慎地应用于合适的化合物,并且尽可能采用混合预测策略。