• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

最优分子设计:结合REINVENT与精确结合自由能排序模拟的生成式主动学习

Optimal Molecular Design: Generative Active Learning Combining REINVENT with Precise Binding Free Energy Ranking Simulations.

作者信息

Loeffler Hannes H, Wan Shunzhou, Klähn Marco, Bhati Agastya P, Coveney Peter V

机构信息

Molecular AI, Discovery Sciences, R&D, AstraZeneca, Mölndal 431 83, Sweden.

Centre for Computational Science, Department of Chemistry, University College London, London WC1H 0AJ, U.K.

出版信息

J Chem Theory Comput. 2024 Sep 3;20(18):8308-28. doi: 10.1021/acs.jctc.4c00576.

DOI:10.1021/acs.jctc.4c00576
PMID:39225482
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11428133/
Abstract

Active learning (AL) is a specific instance of sequential experimental design and uses machine learning to intelligently choose the next data point or batch of molecular structures to be evaluated. In this sense, it closely mimics the iterative design-make-test-analysis cycle of laboratory experiments to find optimized compounds for a given design task. Here, we describe an AL protocol which combines generative molecular AI, using REINVENT, and physics-based absolute binding free energy molecular dynamics simulation, using ESMACS, to discover new ligands for two different target proteins, 3CL and TNKS2. We have deployed our generative active learning (GAL) protocol on Frontier, the world's only exa-scale machine. We show that the protocol can find higher-scoring molecules compared to the baseline, a surrogate ML docking model for 3CL and compounds with experimentally determined binding affinities for TNKS2. The ligands found are also chemically diverse and occupy a different chemical space than the baseline. We vary the batch sizes that are put forward for free energy assessment in each GAL cycle to assess the impact on their efficiency on the GAL protocol and recommend their optimal values in different scenarios. Overall, we demonstrate a powerful capability of the combination of physics-based and AI methods which yields effective chemical space sampling at an unprecedented scale and is of immediate and direct relevance to modern, data-driven drug discovery.

摘要

主动学习(AL)是序列实验设计的一个具体实例,它利用机器学习智能地选择下一个要评估的数据点或一批分子结构。从这个意义上说,它紧密模仿了实验室实验的迭代设计-制造-测试-分析循环,以找到针对给定设计任务的优化化合物。在这里,我们描述了一种主动学习协议,该协议结合了使用REINVENT的生成式分子人工智能和使用ESMACS的基于物理的绝对结合自由能分子动力学模拟,以发现两种不同靶蛋白3CL和TNKS2的新配体。我们已经在世界上唯一的百亿亿次级计算机Frontier上部署了我们的生成式主动学习(GAL)协议。我们表明,与基线相比,该协议可以找到得分更高的分子,基线是3CL的替代机器学习对接模型以及具有实验确定的TNKS2结合亲和力的化合物。所发现的配体在化学上也具有多样性,并且占据与基线不同的化学空间。我们在每个GAL循环中改变提出用于自由能评估的批次大小,以评估其对GAL协议效率的影响,并推荐它们在不同场景下的最佳值。总体而言,我们展示了基于物理的方法和人工智能方法相结合的强大能力,这种结合以前所未有的规模实现了有效的化学空间采样,并且与现代数据驱动的药物发现直接相关。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b85c/11428133/7a1ec3362a2a/ct4c00576_0015.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b85c/11428133/a6b3e097997c/ct4c00576_0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b85c/11428133/b8a75f83e0b2/ct4c00576_0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b85c/11428133/6254b174cf29/ct4c00576_0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b85c/11428133/5ec2ddc5e13f/ct4c00576_0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b85c/11428133/c47fbbdb064d/ct4c00576_0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b85c/11428133/f9cf89cb201f/ct4c00576_0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b85c/11428133/6dcd75f2112e/ct4c00576_0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b85c/11428133/24587fe56fad/ct4c00576_0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b85c/11428133/43cce740391d/ct4c00576_0009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b85c/11428133/1c2088bd586d/ct4c00576_0010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b85c/11428133/d2f4c4fa549d/ct4c00576_0011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b85c/11428133/c4886bb104f1/ct4c00576_0012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b85c/11428133/d82586214eaf/ct4c00576_0013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b85c/11428133/263a89d5144d/ct4c00576_0014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b85c/11428133/7a1ec3362a2a/ct4c00576_0015.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b85c/11428133/a6b3e097997c/ct4c00576_0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b85c/11428133/b8a75f83e0b2/ct4c00576_0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b85c/11428133/6254b174cf29/ct4c00576_0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b85c/11428133/5ec2ddc5e13f/ct4c00576_0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b85c/11428133/c47fbbdb064d/ct4c00576_0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b85c/11428133/f9cf89cb201f/ct4c00576_0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b85c/11428133/6dcd75f2112e/ct4c00576_0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b85c/11428133/24587fe56fad/ct4c00576_0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b85c/11428133/43cce740391d/ct4c00576_0009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b85c/11428133/1c2088bd586d/ct4c00576_0010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b85c/11428133/d2f4c4fa549d/ct4c00576_0011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b85c/11428133/c4886bb104f1/ct4c00576_0012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b85c/11428133/d82586214eaf/ct4c00576_0013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b85c/11428133/263a89d5144d/ct4c00576_0014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b85c/11428133/7a1ec3362a2a/ct4c00576_0015.jpg

相似文献

1
Optimal Molecular Design: Generative Active Learning Combining REINVENT with Precise Binding Free Energy Ranking Simulations.最优分子设计:结合REINVENT与精确结合自由能排序模拟的生成式主动学习
J Chem Theory Comput. 2024 Sep 3;20(18):8308-28. doi: 10.1021/acs.jctc.4c00576.
2
Identification of novel hit molecules targeting M. tuberculosis polyketide synthase 13 by combining generative AI and physics-based methods.通过生成式人工智能与基于物理方法相结合,鉴定新型靶向结核分枝杆菌聚酮合酶 13 的命中分子。
Comput Biol Med. 2024 Jun;176:108573. doi: 10.1016/j.compbiomed.2024.108573. Epub 2024 May 8.
3
Macromolecular crowding: chemistry and physics meet biology (Ascona, Switzerland, 10-14 June 2012).大分子拥挤现象:化学与物理邂逅生物学(瑞士阿斯科纳,2012年6月10日至14日)
Phys Biol. 2013 Aug;10(4):040301. doi: 10.1088/1478-3975/10/4/040301. Epub 2013 Aug 2.
4
Machine learning in computational docking.计算对接中的机器学习。
Artif Intell Med. 2015 Mar;63(3):135-52. doi: 10.1016/j.artmed.2015.02.002. Epub 2015 Feb 16.
5
Optimization of binding affinities in chemical space with generative pre-trained transformer and deep reinforcement learning.利用生成式预训练变换器和深度强化学习在化学空间中优化结合亲和力
F1000Res. 2024 Feb 20;12:757. doi: 10.12688/f1000research.130936.2. eCollection 2023.
6
Reinvent 4: Modern AI-driven generative molecule design.重塑4:现代人工智能驱动的生成式分子设计。
J Cheminform. 2024 Feb 21;16(1):20. doi: 10.1186/s13321-024-00812-5.
7
Optimizing interactions to protein binding sites by integrating docking-scoring strategies into generative AI methods.通过将对接评分策略整合到生成式人工智能方法中,优化与蛋白质结合位点的相互作用。
Front Chem. 2022 Oct 19;10:1012507. doi: 10.3389/fchem.2022.1012507. eCollection 2022.
8
Combining Cloud-Based Free-Energy Calculations, Synthetically Aware Enumerations, and Goal-Directed Generative Machine Learning for Rapid Large-Scale Chemical Exploration and Optimization.结合基于云的自由能计算、综合感知枚举和目标导向的生成式机器学习,实现快速大规模化学探索和优化。
J Chem Inf Model. 2020 Sep 28;60(9):4311-4325. doi: 10.1021/acs.jcim.0c00120. Epub 2020 Jun 19.
9
Generative organic electronic molecular design informed by quantum chemistry.由量子化学推动的有机电子分子生成设计。
Chem Sci. 2023 Sep 13;14(40):11045-11055. doi: 10.1039/d3sc03781a. eCollection 2023 Oct 18.
10
Ensemble Simulations and Experimental Free Energy Distributions: Evaluation and Characterization of Isoxazole Amides as SMYD3 Inhibitors.整体模拟和实验自由能分布:作为 SMYD3 抑制剂的异噁唑酰胺的评估和特征分析。
J Chem Inf Model. 2022 May 23;62(10):2561-2570. doi: 10.1021/acs.jcim.2c00255. Epub 2022 May 4.

引用本文的文献

1
Generative design of singlet fission materials leveraging a fragment-oriented database.利用面向片段的数据库进行单线态裂变材料的生成式设计。
Chem Sci. 2025 Aug 25. doi: 10.1039/d5sc03184b.
2
Optimizing drug design by merging generative AI with a physics-based active learning framework.通过将生成式人工智能与基于物理学的主动学习框架相结合来优化药物设计。
Commun Chem. 2025 Aug 8;8(1):238. doi: 10.1038/s42004-025-01635-7.
3
Efficient Design of Affilin Protein Binders for HER3.HER3亲和蛋白结合剂的高效设计

本文引用的文献

1
Machine Learning Guided AQFEP: A Fast and Efficient Absolute Free Energy Perturbation Solution for Virtual Screening.机器学习引导的AQFEP:一种用于虚拟筛选的快速高效的绝对自由能微扰解决方案。
J Chem Theory Comput. 2024 Aug 15;20(16):7188-98. doi: 10.1021/acs.jctc.4c00399.
2
Evaluation of reinforcement learning in transformer-based molecular design.基于Transformer的分子设计中强化学习的评估
J Cheminform. 2024 Aug 8;16(1):95. doi: 10.1186/s13321-024-00887-0.
3
QSARtuna: An Automated QSAR Modeling Platform for Molecular Property Prediction in Drug Design.
Int J Mol Sci. 2025 May 14;26(10):4683. doi: 10.3390/ijms26104683.
4
Molecular Dynamics as a Precision Therapy: A Perspective on Epileptic Encephalopathies.分子动力学作为一种精准疗法:对癫痫性脑病的见解
Mol Diagn Ther. 2025 Apr 11. doi: 10.1007/s40291-025-00780-y.
5
DeepPath: Overcoming data scarcity for protein transition pathway prediction using physics-based deep learning.深度路径:利用基于物理学的深度学习克服蛋白质转变途径预测中的数据稀缺问题。
bioRxiv. 2025 Mar 2:2025.02.27.640693. doi: 10.1101/2025.02.27.640693.
QSARtuna:一个用于药物设计中分子性质预测的自动化 QSAR 建模平台。
J Chem Inf Model. 2024 Jul 22;64(14):5365-5374. doi: 10.1021/acs.jcim.4c00457. Epub 2024 Jul 1.
4
Advancing material property prediction: using physics-informed machine learning models for viscosity.推进材料性能预测:使用物理信息机器学习模型预测粘度。
J Cheminform. 2024 Mar 14;16(1):31. doi: 10.1186/s13321-024-00820-5.
5
Benchmarking Active Learning Protocols for Ligand-Binding Affinity Prediction.基于配体结合亲和力预测的主动学习协议基准测试。
J Chem Inf Model. 2024 Mar 25;64(6):1955-1965. doi: 10.1021/acs.jcim.4c00220. Epub 2024 Mar 6.
6
Combining IC or Values from Different Sources Is a Source of Significant Noise.合并来自不同来源的IC或值是显著噪声的一个来源。
J Chem Inf Model. 2024 Mar 11;64(5):1560-1567. doi: 10.1021/acs.jcim.4c00049. Epub 2024 Feb 23.
7
Reinvent 4: Modern AI-driven generative molecule design.重塑4:现代人工智能驱动的生成式分子设计。
J Cheminform. 2024 Feb 21;16(1):20. doi: 10.1186/s13321-024-00812-5.
8
Regression-Based Active Learning for Accessible Acceleration of Ultra-Large Library Docking.基于回归的主动学习可实现超大库对接的无障碍加速。
J Chem Inf Model. 2024 Apr 8;64(7):2612-2623. doi: 10.1021/acs.jcim.3c01661. Epub 2023 Dec 29.
9
Chemprop: A Machine Learning Package for Chemical Property Prediction.Chemprop:一个用于化学性质预测的机器学习工具包。
J Chem Inf Model. 2024 Jan 8;64(1):9-17. doi: 10.1021/acs.jcim.3c01250. Epub 2023 Dec 26.
10
Ensemble-Based Approaches Ensure Reliability and Reproducibility.基于集成的方法可确保可靠性和可重复性。
J Chem Inf Model. 2023 Nov 27;63(22):6959-6963. doi: 10.1021/acs.jcim.3c01654. Epub 2023 Nov 15.