分子动力学模拟能否通过机器学习改进对蛋白质-配体结合亲和力的预测？

Can molecular dynamics simulations improve predictions of protein-ligand binding affinity with machine learning?

作者信息

Gu Shukai, Shen Chao, Yu Jiahui, Zhao Hong, Liu Huanxiang, Liu Liwei, Sheng Rong, Xu Lei, Wang Zhe, Hou Tingjun, Kang Yu

机构信息

Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, Zhejiang, China.

Faculty of Applied Science, Macao Polytechnic University, Macao, SAR, China.

出版信息

Brief Bioinform. 2023 Mar 19;24(2). doi: 10.1093/bib/bbad008.

DOI:10.1093/bib/bbad008

PMID:36681903

Abstract

Binding affinity prediction largely determines the discovery efficiency of lead compounds in drug discovery. Recently, machine learning (ML)-based approaches have attracted much attention in hopes of enhancing the predictive performance of traditional physics-based approaches. In this study, we evaluated the impact of structural dynamic information on the binding affinity prediction by comparing the models trained on different dimensional descriptors, using three targets (i.e. JAK1, TAF1-BD2 and DDR1) and their corresponding ligands as the examples. Here, 2D descriptors are traditional ECFP4 fingerprints, 3D descriptors are the energy terms of the Smina and NNscore scoring functions and 4D descriptors contain the structural dynamic information derived from the trajectories based on molecular dynamics (MD) simulations. We systematically investigate the MD-refined binding affinity prediction performance of three classical ML algorithms (i.e. RF, SVR and XGB) as well as two common virtual screening methods, namely Glide docking and MM/PBSA. The outcomes of the ML models built using various dimensional descriptors and their combinations reveal that the MD refinement with the optimized protocol can improve the predictive performance on the TAF1-BD2 target with considerable structural flexibility, but not for the less flexible JAK1 and DDR1 targets, when taking docking poses as the initial structure instead of the crystal structures. The results highlight the importance of the initial structures to the final performance of the model through conformational analysis on the three targets with different flexibility.

摘要

结合亲和力预测在很大程度上决定了药物研发中先导化合物的发现效率。近年来，基于机器学习（ML）的方法备受关注，有望提高传统基于物理方法的预测性能。在本研究中，我们以三个靶点（即JAK1、TAF1-BD2和DDR1）及其相应配体为例，通过比较在不同维度描述符上训练的模型，评估了结构动态信息对结合亲和力预测的影响。这里，二维描述符是传统的ECFP4指纹，三维描述符是Smina和NNscore评分函数的能量项，四维描述符包含基于分子动力学（MD）模拟轨迹得出的结构动态信息。我们系统地研究了三种经典机器学习算法（即随机森林、支持向量回归和极端梯度提升）以及两种常见虚拟筛选方法（即Glide对接和MM/PBSA）的MD优化结合亲和力预测性能。使用各种维度描述符及其组合构建的机器学习模型的结果表明，当以对接构象而非晶体结构作为初始结构时，采用优化方案的MD优化可以提高对具有相当结构灵活性的TAF1-BD2靶点的预测性能，但对灵活性较低的JAK1和DDR1靶点则不然。通过对三个具有不同灵活性的靶点进行构象分析，结果突出了初始结构对模型最终性能的重要性。

相似文献

Can molecular dynamics simulations improve predictions of protein-ligand binding affinity with machine learning?

Brief Bioinform. 2023 Mar 19;24(2). doi: 10.1093/bib/bbad008.

SCORCH: Improving structure-based virtual screening with machine learning classifiers, data augmentation, and uncertainty estimation.

J Adv Res. 2023 Apr;46:135-147. doi: 10.1016/j.jare.2022.07.001. Epub 2022 Jul 25.

Empirical Scoring Functions for Affinity Prediction of Protein-ligand Complexes.

Mol Inform. 2016 Dec;35(11-12):541-548. doi: 10.1002/minf.201600048. Epub 2016 Jul 8.

Task-Specific Scoring Functions for Predicting Ligand Binding Poses and Affinity and for Screening Enrichment.

J Chem Inf Model. 2018 Jan 22;58(1):119-133. doi: 10.1021/acs.jcim.7b00309. Epub 2017 Dec 20.

Rescoring of docking poses under Occam's Razor: are there simpler solutions?

J Comput Aided Mol Des. 2018 Sep;32(9):877-888. doi: 10.1007/s10822-018-0155-5. Epub 2018 Sep 1.

Boosted neural networks scoring functions for accurate ligand docking and ranking.

J Bioinform Comput Biol. 2018 Apr;16(2):1850004. doi: 10.1142/S021972001850004X. Epub 2018 Feb 4.

binding affinity prediction for metabotropic glutamate receptors using both endpoint free energy methods and a machine learning-based scoring function.

Phys Chem Chem Phys. 2022 Aug 3;24(30):18291-18305. doi: 10.1039/d2cp01727j.

Geometry Optimization Algorithms in Conjunction with the Machine Learning Potential ANI-2x Facilitate the Structure-Based Virtual Screening and Binding Mode Prediction.

Biomolecules. 2024 May 31;14(6):648. doi: 10.3390/biom14060648.

Beware of machine learning-based scoring functions-on the danger of developing black boxes.

J Chem Inf Model. 2014 Oct 27;54(10):2807-15. doi: 10.1021/ci500406k. Epub 2014 Sep 24.

Machine learning in computational docking.

Artif Intell Med. 2015 Mar;63(3):135-52. doi: 10.1016/j.artmed.2015.02.002. Epub 2015 Feb 16.

引用本文的文献

Spatio-temporal learning from molecular dynamics simulations for protein-ligand binding affinity prediction.

Bioinformatics. 2025 Aug 2;41(8). doi: 10.1093/bioinformatics/btaf429.

Effects of Monoterpene-Based Biostimulants on Chickpea ( L.) Plants: Functional and Molecular Insights.

Biology (Basel). 2025 Jun 5;14(6):657. doi: 10.3390/biology14060657.

SeqDance: A Protein Language Model for Representing Protein Dynamic Properties.

bioRxiv. 2024 Oct 15:2024.10.11.617911. doi: 10.1101/2024.10.11.617911.

A comprehensive review of artificial intelligence for pharmacology research.

Front Genet. 2024 Sep 3;15:1450529. doi: 10.3389/fgene.2024.1450529. eCollection 2024.

Assessment of machine learning models trained by molecular dynamics simulations results for inferring ethanol adsorption on an aluminium surface.

Sci Rep. 2024 Sep 3;14(1):20437. doi: 10.1038/s41598-024-71007-z.

Unsupervised deep learning for molecular dynamics simulations: a novel analysis of protein-ligand interactions in SARS-CoV-2 M.

RSC Adv. 2023 Nov 22;13(48):34249-34261. doi: 10.1039/d3ra06375e. eCollection 2023 Nov 16.

Discovery of Nonretinoid Inhibitors of CRBP1: Structural and Dynamic Insights for Ligand-Binding Mechanisms.

ACS Chem Biol. 2023 Oct 20;18(10):2309-2323. doi: 10.1021/acschembio.3c00402. Epub 2023 Sep 15.

Neural networks prediction of the protein-ligand binding affinity with circular fingerprints.

Technol Health Care. 2023;31(S1):487-495. doi: 10.3233/THC-236042.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

分子动力学模拟能否通过机器学习改进对蛋白质-配体结合亲和力的预测？

Can molecular dynamics simulations improve predictions of protein-ligand binding affinity with machine learning?

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献