主动学习和非热力学自由能在化学空间探索中的应用。

Chemical Space Exploration with Active Learning and Alchemical Free Energies.

机构信息

Computational Biomolecular Dynamics Group, Department of Theoretical and Computational Biophysics, Max Planck Institute for Multidisciplinary Sciences, Am Fassberg 11, D-37077 Göttingen, Germany.

Computational Chemistry, Janssen Research & Development, Janssen Pharmaceutica N. V., Turnhoutseweg 30, 2340 Beerse, Belgium.

出版信息

J Chem Theory Comput. 2022 Oct 11;18(10):6259-6270. doi: 10.1021/acs.jctc.2c00752. Epub 2022 Sep 23.

DOI:10.1021/acs.jctc.2c00752

PMID:36148968

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9558370/

Abstract

Drug discovery can be thought of as a search for a needle in a haystack: searching through a large chemical space for the most active compounds. Computational techniques can narrow the search space for experimental follow up, but even they become unaffordable when evaluating large numbers of molecules. Therefore, machine learning (ML) strategies are being developed as computationally cheaper complementary techniques for navigating and triaging large chemical libraries. Here, we explore how an active learning protocol can be combined with first-principles based alchemical free energy calculations to identify high affinity phosphodiesterase 2 (PDE2) inhibitors. We first calibrate the procedure using a set of experimentally characterized PDE2 binders. The optimized protocol is then used prospectively on a large chemical library to navigate toward potent inhibitors. In the active learning cycle, at every iteration a small fraction of compounds is probed by alchemical calculations and the obtained affinities are used to train ML models. With successive rounds, high affinity binders are identified by explicitly evaluating only a small subset of compounds in a large chemical library, thus providing an efficient protocol that robustly identifies a large fraction of true positives.

摘要

药物发现可以被视为在干草堆中寻找针

在大量的化学空间中搜索最活跃的化合物。计算技术可以缩小实验后续的搜索空间，但当评估大量分子时，即使是这些技术也变得负担不起。因此，机器学习（ML）策略正在被开发为计算上更便宜的补充技术，用于导航和分类大型化学库。在这里，我们探索了如何将主动学习协议与基于第一性原理的量子化学自由能计算相结合，以鉴定高亲和力磷酸二酯酶 2（PDE2）抑制剂。我们首先使用一组经过实验表征的 PDE2 结合物对该程序进行校准。然后，该优化协议前瞻性地用于大型化学库中，以寻找有效的抑制剂。在主动学习循环中，在每次迭代中，一小部分化合物通过量子化学计算进行探测，并使用获得的亲和力来训练 ML 模型。通过连续几轮，通过明确评估大型化学库中的一小部分化合物，鉴定出高亲和力结合物，从而提供一种有效的方法，可以稳健地鉴定出大量的真正阳性结果。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4f2f/9558370/dc863070cb46/ct2c00752_0001.jpg

相似文献

Chemical Space Exploration with Active Learning and Alchemical Free Energies.

J Chem Theory Comput. 2022 Oct 11;18(10):6259-6270. doi: 10.1021/acs.jctc.2c00752. Epub 2022 Sep 23.

Absolute Alchemical Free Energy Calculations for Ligand Binding: A Beginner's Guide.

Methods Mol Biol. 2018;1762:199-232. doi: 10.1007/978-1-4939-7756-7_11.

A Critical Review of Validation, Blind Testing, and Real- World Use of Alchemical Protein-Ligand Binding Free Energy Calculations.

Curr Top Med Chem. 2017;17(23):2577-2585. doi: 10.2174/1568026617666170414142131.

Reaction-Based Enumeration, Active Learning, and Free Energy Calculations To Rapidly Explore Synthetically Tractable Chemical Space and Optimize Potency of Cyclin-Dependent Kinase 2 Inhibitors.

J Chem Inf Model. 2019 Sep 23;59(9):3782-3793. doi: 10.1021/acs.jcim.9b00367. Epub 2019 Aug 22.

Protein-Ligand Binding Free Energy Calculations with FEP.

Methods Mol Biol. 2019;2022:201-232. doi: 10.1007/978-1-4939-9608-7_9.

SAMPL7 TrimerTrip host-guest binding affinities from extensive alchemical and end-point free energy calculations.

J Comput Aided Mol Des. 2021 Jan;35(1):117-129. doi: 10.1007/s10822-020-00351-9. Epub 2020 Oct 10.

Exploration of Ultralarge Compound Collections for Drug Discovery.

J Chem Inf Model. 2022 May 9;62(9):2021-2034. doi: 10.1021/acs.jcim.2c00224. Epub 2022 Apr 14.

Efficient search of chemical space: navigating from fragments to structurally diverse chemotypes.

J Med Chem. 2013 Nov 14;56(21):8879-91. doi: 10.1021/jm401309q. Epub 2013 Oct 31.

Exploration of a Large Virtual Chemical Space: Identification of Potent Inhibitors of Lactate Dehydrogenase-A against Pancreatic Cancer.

J Chem Inf Model. 2023 Feb 13;63(3):1028-1043. doi: 10.1021/acs.jcim.2c01544. Epub 2023 Jan 16.

DNA-encoded chemical libraries: advancing beyond conventional small-molecule libraries.

Acc Chem Res. 2014 Apr 15;47(4):1247-55. doi: 10.1021/ar400284t. Epub 2014 Mar 28.

引用本文的文献

Advancing genetic engineering with active learning: theory, implementations and potential opportunities.

Brief Bioinform. 2025 Jul 2;26(4). doi: 10.1093/bib/bbaf286.

PAL - parallel active learning for machine-learned potentials.

Digit Discov. 2025 Jun 22. doi: 10.1039/d5dd00073d.

Scaffold Hopping with Generative Reinforcement Learning.

J Chem Inf Model. 2025 Jul 14;65(13):6513-6525. doi: 10.1021/acs.jcim.5c00029. Epub 2025 Jun 26.

Active Learning FEP Using 3D-QSAR for Prioritizing Bioisosteres in Medicinal Chemistry.

ACS Med Chem Lett. 2025 Apr 29;16(6):984-990. doi: 10.1021/acsmedchemlett.4c00554. eCollection 2025 Jun 12.

Acceleration of the GROMACS Free-Energy Perturbation Calculations on GPUs.

ACS Omega. 2025 May 30;10(22):22858-22873. doi: 10.1021/acsomega.5c00151. eCollection 2025 Jun 10.

Active Learning Improves Ionization Efficiency Predictions and Quantification in Nontargeted LC/HRMS.

Anal Chem. 2025 Jul 1;97(25):13131-13139. doi: 10.1021/acs.analchem.5c00816. Epub 2025 Jun 13.

Predicting high-fitness viral protein variants with Bayesian active learning and biophysics.

Proc Natl Acad Sci U S A. 2025 Jun 17;122(24):e2503742122. doi: 10.1073/pnas.2503742122. Epub 2025 Jun 9.

Automated On-the-Fly Optimization of Resource Allocation for Efficient Free Energy Simulations.

J Chem Inf Model. 2025 May 26;65(10):4932-4951. doi: 10.1021/acs.jcim.4c02107. Epub 2025 May 6.

Few-Shot Viral Variant Detection via Bayesian Active Learning and Biophysics.

bioRxiv. 2025 Mar 13:2025.03.12.642881. doi: 10.1101/2025.03.12.642881.

Prospective evaluation of structure-based simulations reveal their ability to predict the impact of kinase mutations on inhibitor binding.

bioRxiv. 2025 Mar 1:2024.11.15.623861. doi: 10.1101/2024.11.15.623861.

本文引用的文献

Active Learning Guided Drug Design Lead Optimization Based on Relative Binding Free Energy Modeling.

J Chem Inf Model. 2023 Jan 23;63(2):583-594. doi: 10.1021/acs.jcim.2c01052. Epub 2023 Jan 4.

Best practices for constructing, preparing, and evaluating protein-ligand binding affinity benchmarks [Article v0.1].

Living J Comput Mol Sci. 2022;4(1). doi: 10.33011/livecoms.4.1.1497. Epub 2022 Aug 30.

Data-driven discovery of cardiolipin-selective small molecules by computational active learning.

Chem Sci. 2022 Mar 2;13(16):4498-4511. doi: 10.1039/d2sc00116k. eCollection 2022 Apr 20.

On the Frustration to Predict Binding Affinities from Protein-Ligand Structures with Deep Neural Networks.

J Med Chem. 2022 Jun 9;65(11):7946-7958. doi: 10.1021/acs.jmedchem.2c00487. Epub 2022 May 24.

GROMACS in the Cloud: A Global Supercomputer to Speed Up Alchemical Drug Design.

J Chem Inf Model. 2022 Apr 11;62(7):1691-1711. doi: 10.1021/acs.jcim.2c00044. Epub 2022 Mar 30.

Pre-Exascale Computing of Protein-Ligand Binding Free Energies with Open Source Software for Drug Design.

J Chem Inf Model. 2022 Mar 14;62(5):1172-1177. doi: 10.1021/acs.jcim.1c01445. Epub 2022 Feb 22.

The Impact of Experimental and Calculated Error on the Performance of Affinity Predictions.

J Chem Inf Model. 2022 Feb 14;62(3):703-717. doi: 10.1021/acs.jcim.1c01214. Epub 2022 Jan 21.

Efficient Exploration of Chemical Space with Docking and Deep Learning.

J Chem Theory Comput. 2021 Nov 9;17(11):7106-7119. doi: 10.1021/acs.jctc.1c00810. Epub 2021 Sep 30.

Best Practices for Alchemical Free Energy Calculations [Article v1.0].

Living J Comput Mol Sci. 2020;2(1). doi: 10.33011/livecoms.2.1.18378.

Large scale relative protein ligand binding affinities using non-equilibrium alchemy.

Chem Sci. 2019 Dec 2;11(4):1140-1152. doi: 10.1039/c9sc03754c.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

主动学习和非热力学自由能在化学空间探索中的应用。

Chemical Space Exploration with Active Learning and Alchemical Free Energies.

机构信息

Computational Biomolecular Dynamics Group, Department of Theoretical and Computational Biophysics, Max Planck Institute for Multidisciplinary Sciences, Am Fassberg 11, D-37077 Göttingen, Germany.

Computational Chemistry, Janssen Research & Development, Janssen Pharmaceutica N. V., Turnhoutseweg 30, 2340 Beerse, Belgium.

出版信息

J Chem Theory Comput. 2022 Oct 11;18(10):6259-6270. doi: 10.1021/acs.jctc.2c00752. Epub 2022 Sep 23.

DOI:10.1021/acs.jctc.2c00752

PMID:36148968

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9558370/

Abstract

摘要

主动学习和非热力学自由能在化学空间探索中的应用。

Chemical Space Exploration with Active Learning and Alchemical Free Energies.

机构信息

出版信息

药物发现可以被视为在干草堆中寻找针

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

主动学习和非热力学自由能在化学空间探索中的应用。

Chemical Space Exploration with Active Learning and Alchemical Free Energies.

机构信息

出版信息

药物发现可以被视为在干草堆中寻找针

相似文献

引用本文的文献

本文引用的文献