深度学习在配体对接中的应用：虚拟筛选的挑战与展望。

Advancing Ligand Docking through Deep Learning: Challenges and Prospects in Virtual Screening.

机构信息

College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, Zhejiang, China.

Hangzhou Carbonsilicon AI Technology Co., Ltd, Hangzhou 310018, Zhejiang, China.

出版信息

Acc Chem Res. 2024 May 21;57(10):1500-1509. doi: 10.1021/acs.accounts.4c00093. Epub 2024 Apr 5.

DOI:10.1021/acs.accounts.4c00093

PMID:38577892

Abstract

Molecular docking, also termed ligand docking (LD), is a pivotal element of structure-based virtual screening (SBVS) used to predict the binding conformations and affinities of protein-ligand complexes. Traditional LD methodologies rely on a search and scoring framework, utilizing heuristic algorithms to explore binding conformations and scoring functions to evaluate binding strengths. However, to meet the efficiency demands of SBVS, these algorithms and functions are often simplified, prioritizing speed over accuracy.The emergence of deep learning (DL) has exerted a profound impact on diverse fields, ranging from natural language processing to computer vision and drug discovery. DeepMind's AlphaFold2 has impressively exhibited its ability to accurately predict protein structures solely from amino acid sequences, highlighting the remarkable potential of DL in conformation prediction. This groundbreaking advancement circumvents the traditional search-scoring frameworks in LD, enhancing both accuracy and processing speed and thereby catalyzing a broader adoption of DL algorithms in binding pose prediction. Nevertheless, a consensus on certain aspects remains elusive.In this Account, we delineate the current status of employing DL to augment LD within the VS paradigm, highlighting our contributions to this domain. Furthermore, we discuss the challenges and future prospects, drawing insights from our scholarly investigations. Initially, we present an overview of VS and LD, followed by an introduction to DL paradigms, which deviate significantly from traditional search-scoring frameworks. Subsequently, we delve into the challenges associated with the development of DL-based LD (DLLD), encompassing evaluation metrics, application scenarios, and physical plausibility of the predicted conformations. In the evaluation of LD algorithms, it is essential to recognize the multifaceted nature of the metrics. While the accuracy of binding pose prediction, often measured by the success rate, is a pivotal aspect, the scoring/screening power and computational speed of these algorithms are equally important given the pivotal role of LD tools in VS. Regarding application scenarios, early methods focused on blind docking, where the binding site is unknown. However, recent studies suggest a shift toward identifying binding sites rather than solely predicting binding poses within these models. In contrast, LD with a known pocket in VS has been shown to be more practical. Physical plausibility poses another significant challenge. Although DLLD models often achieve higher success rates compared to traditional methods, they may generate poses with implausible local structures, such as incorrect bond angles or lengths, which are disadvantageous for postprocessing tasks like visualization. Finally, we discuss the future perspectives for DLLD, emphasizing the need to improve generalization ability, strike a balance between speed and accuracy, account for protein conformation flexibility, and enhance physical plausibility. Additionally, we delve into the comparison between generative and regression algorithms in this context, exploring their respective strengths and potential.

摘要

分子对接，也称为配体对接（LD），是基于结构的虚拟筛选（SBVS）的一个关键组成部分，用于预测蛋白质-配体复合物的结合构象和亲和力。传统的 LD 方法依赖于搜索和评分框架，使用启发式算法来探索结合构象，使用评分函数来评估结合强度。然而，为了满足 SBVS 的效率要求，这些算法和函数通常被简化，优先考虑速度而不是准确性。深度学习（DL）的出现对自然语言处理、计算机视觉和药物发现等多个领域产生了深远的影响。DeepMind 的 AlphaFold2 令人印象深刻地展示了仅从氨基酸序列准确预测蛋白质结构的能力，突出了 DL 在构象预测方面的巨大潜力。这一开创性的进展绕过了 LD 中的传统搜索-评分框架，提高了准确性和处理速度，从而促进了 DL 算法在结合构象预测中的更广泛应用。然而，在某些方面仍然缺乏共识。在本账户中，我们描述了在 VS 范式中使用 DL 增强 LD 的现状，强调了我们在这一领域的贡献。此外，我们还讨论了挑战和未来展望，从我们的学术研究中汲取了灵感。首先，我们介绍了 VS 和 LD 的概述，然后介绍了与传统搜索-评分框架有很大不同的 DL 范式。随后，我们深入探讨了基于 DL 的 LD（DLLD）发展所面临的挑战，包括评估指标、应用场景和预测构象的物理合理性。在 LD 算法的评估中，必须认识到指标的多面性。虽然结合构象预测的准确性，通常通过成功率来衡量，是一个关键方面，但这些算法的评分/筛选能力和计算速度同样重要，因为 LD 工具在 VS 中起着关键作用。关于应用场景，早期的方法侧重于盲对接，即不知道结合位点。然而，最近的研究表明，这些模型中的一个趋势是从识别结合位点转变为不仅仅预测结合构象。相比之下，VS 中具有已知口袋的 LD 已被证明更具实用性。物理合理性是另一个重大挑战。尽管 DLLD 模型通常比传统方法获得更高的成功率，但它们可能会生成具有不合理局部结构的构象，例如不正确的键角或长度，这不利于后处理任务，如可视化。最后，我们讨论了 DLLD 的未来展望，强调需要提高泛化能力、在速度和准确性之间取得平衡、考虑蛋白质构象的灵活性，并提高物理合理性。此外，我们还探讨了在这种情况下生成算法和回归算法之间的比较，探索了它们各自的优势和潜力。

相似文献

Advancing Ligand Docking through Deep Learning: Challenges and Prospects in Virtual Screening.深度学习在配体对接中的应用：虚拟筛选的挑战与展望。

Acc Chem Res. 2024 May 21;57(10):1500-1509. doi: 10.1021/acs.accounts.4c00093. Epub 2024 Apr 5.

Harnessing deep learning for enhanced ligand docking.利用深度学习增强配体对接。

Trends Pharmacol Sci. 2024 Feb;45(2):103-106. doi: 10.1016/j.tips.2023.12.004. Epub 2023 Dec 30.

A fully differentiable ligand pose optimization framework guided by deep learning and a traditional scoring function.一个由深度学习和传统评分函数引导的完全可微配体构象优化框架。

Brief Bioinform. 2023 Jan 19;24(1). doi: 10.1093/bib/bbac520.

Improving docking results via reranking of ensembles of ligand poses in multiple X-ray protein conformations with MM-GBSA.通过使用 MM-GBSA 对多个 X 射线蛋白质构象中的配体构象进行重新排序，从而提高对接结果。

J Chem Inf Model. 2014 Oct 27;54(10):2697-717. doi: 10.1021/ci5003735. Epub 2014 Sep 30.

Boosted neural networks scoring functions for accurate ligand docking and ranking.用于精确配体对接和排序的增强神经网络评分函数。

J Bioinform Comput Biol. 2018 Apr;16(2):1850004. doi: 10.1142/S021972001850004X. Epub 2018 Feb 4.

Task-Specific Scoring Functions for Predicting Ligand Binding Poses and Affinity and for Screening Enrichment.用于预测配体结合构象和亲和力以及进行筛选富集的任务特定评分函数。

J Chem Inf Model. 2018 Jan 22;58(1):119-133. doi: 10.1021/acs.jcim.7b00309. Epub 2017 Dec 20.

Deep Learning Model for Efficient Protein-Ligand Docking with Implicit Side-Chain Flexibility.具有隐式侧链灵活性的高效蛋白质-配体对接深度学习模型。

J Chem Inf Model. 2023 Mar 27;63(6):1695-1707. doi: 10.1021/acs.jcim.2c01436. Epub 2023 Mar 14.

Advances in Docking.对接技术的新进展。

Curr Med Chem. 2019;26(42):7555-7580. doi: 10.2174/0929867325666180904115000.

ViTScore: A Novel Three-Dimensional Vision Transformer Method for Accurate Prediction of Protein-Ligand Docking Poses.ViTScore：一种用于准确预测蛋白质-配体对接构象的新型三维视觉Transformer 方法。

IEEE Trans Nanobioscience. 2023 Oct;22(4):734-743. doi: 10.1109/TNB.2023.3274640. Epub 2023 Oct 3.

Machine learning in computational docking.计算对接中的机器学习。

Artif Intell Med. 2015 Mar;63(3):135-52. doi: 10.1016/j.artmed.2015.02.002. Epub 2015 Feb 16.

引用本文的文献

Decoding the limits of deep learning in molecular docking for drug discovery.解码深度学习在药物发现分子对接中的局限性。

Chem Sci. 2025 Aug 19. doi: 10.1039/d5sc05395a.

The future of pharmaceuticals: Artificial intelligence in drug discovery and development.制药的未来：药物研发中的人工智能

J Pharm Anal. 2025 Aug;15(8):101248. doi: 10.1016/j.jpha.2025.101248. Epub 2025 Feb 26.

AlphaFold 3: an unprecedent opportunity for fundamental research and drug development.阿尔法折叠3：基础研究和药物开发的前所未有的机遇。

Precis Clin Med. 2025 Jul 1;8(3):pbaf015. doi: 10.1093/pcmedi/pbaf015. eCollection 2025 Sep.

Artificial intelligence-driven discovery of YH395A: A novel TGFβR1 inhibitor with potent anti-tumor activity against triple-negative breast cancer.人工智能驱动发现YH395A：一种对三阴性乳腺癌具有强效抗肿瘤活性的新型转化生长因子β受体1（TGFβR1）抑制剂。

Cell Commun Signal. 2025 Jul 8;23(1):326. doi: 10.1186/s12964-025-02337-2.

Ten quick tips to perform meaningful and reproducible molecular docking calculations.进行有意义且可重复的分子对接计算的十条快速提示。

PLoS Comput Biol. 2025 May 9;21(5):e1013030. doi: 10.1371/journal.pcbi.1013030. eCollection 2025 May.

Integrating Machine Learning-Based Pose Sampling with Established Scoring Functions for Virtual Screening.将基于机器学习的姿势采样与既定评分函数相结合用于虚拟筛选。

J Chem Inf Model. 2025 May 26;65(10):4833-4843. doi: 10.1021/acs.jcim.5c00380. Epub 2025 May 9.

Principles and Design of Molecular Tools for Sensing and Perturbing Cell Surface Receptor Activity.用于感知和扰动细胞表面受体活性的分子工具的原理与设计

Chem Rev. 2025 Mar 12;125(5):2665-2702. doi: 10.1021/acs.chemrev.4c00582. Epub 2025 Feb 25.

De novo in silico screening of natural products for antidiabetic drug discovery: ADMET profiling, molecular docking, and molecular dynamics simulations.用于抗糖尿病药物发现的天然产物从头虚拟筛选：ADMET 分析、分子对接和分子动力学模拟。

In Silico Pharmacol. 2025 Feb 17;13(1):29. doi: 10.1007/s40203-025-00320-w. eCollection 2025.

SurfDock is a surface-informed diffusion generative model for reliable and accurate protein-ligand complex prediction.SurfDock是一种基于表面信息的扩散生成模型，用于可靠且准确地预测蛋白质-配体复合物。

Nat Methods. 2025 Feb;22(2):310-322. doi: 10.1038/s41592-024-02516-y. Epub 2024 Nov 27.

OpenDock: a pytorch-based open-source framework for protein-ligand docking and modelling.OpenDock：一个基于 PyTorch 的开源蛋白质-配体对接和建模框架。

Bioinformatics. 2024 Nov 1;40(11). doi: 10.1093/bioinformatics/btae628.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

深度学习在配体对接中的应用：虚拟筛选的挑战与展望。

Advancing Ligand Docking through Deep Learning: Challenges and Prospects in Virtual Screening.

机构信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献