• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

结合基于云的自由能计算、综合感知枚举和目标导向的生成式机器学习,实现快速大规模化学探索和优化。

Combining Cloud-Based Free-Energy Calculations, Synthetically Aware Enumerations, and Goal-Directed Generative Machine Learning for Rapid Large-Scale Chemical Exploration and Optimization.

机构信息

Schrödinger, Inc., 120 West 45th Street, 17th floor, New York, New York 10036, United States.

出版信息

J Chem Inf Model. 2020 Sep 28;60(9):4311-4325. doi: 10.1021/acs.jcim.0c00120. Epub 2020 Jun 19.

DOI:10.1021/acs.jcim.0c00120
PMID:32484669
Abstract

The hit identification process usually involves the profiling of millions to more recently billions of compounds either via traditional experimental high-throughput screens (HTS) or computational virtual high-throughput screens (vHTS). We have previously demonstrated that, by coupling reaction-based enumeration, active learning, and free energy calculations, a similarly large-scale exploration of chemical space can be extended to the hit-to-lead process. In this work, we augment that approach by coupling large scale enumeration and cloud-based free energy perturbation (FEP) profiling with goal-directed generative machine learning, which results in a higher enrichment of potent ideas compared to large scale enumeration alone, while simultaneously staying within the bounds of predefined drug-like property space. We can achieve this by building the molecular distribution for generative machine learning from the PathFinder rules-based enumeration and optimizing for a weighted sum QSAR-based multiparameter optimization function. We examine the utility of this combined approach by designing potent inhibitors of cyclin-dependent kinase 2 (CDK2) and demonstrate a coupled workflow that can (1) provide a 6.4-fold enrichment improvement in identifying <10 nM compounds over random selection and a 1.5-fold enrichment in identifying <10 nM compounds over our previous method, (2) rapidly explore relevant chemical space outside the bounds of commercial reagents, (3) use generative ML approaches to "learn" the SAR from large scale in silico enumerations and generate novel idea molecules for a flexible receptor site that are both potent and within relevant physicochemical space, and (4) produce over 3 000 000 idea molecules and run 1935 FEP simulations, identifying 69 ideas with a predicted IC < 10 nM and 358 ideas with a predicted IC < 100 nM. The reported data suggest combining both reaction-based and generative machine learning for ideation results in a higher enrichment of potent compounds over previously described approaches and has the potential to rapidly accelerate the discovery of novel chemical matter within a predefined potency and property space.

摘要

命中鉴定过程通常涉及通过传统的实验高通量筛选(HTS)或计算虚拟高通量筛选(vHTS)对数百万种甚至最近数十亿种化合物进行分析。我们之前已经证明,通过结合基于反应的枚举、主动学习和自由能计算,可以将同样大规模的化学空间探索扩展到从命中到先导的过程。在这项工作中,我们通过将大规模枚举和基于云的自由能扰动(FEP)分析与目标导向的生成式机器学习相结合来扩展该方法,与仅进行大规模枚举相比,这可以更高地富集有效化合物,同时保持在预定义的类药性空间范围内。我们可以通过从 PathFinder 基于规则的枚举中构建生成式机器学习的分子分布并对基于权重的 QSAR 多参数优化函数进行优化来实现这一点。我们通过设计细胞周期蛋白依赖性激酶 2(CDK2)的有效抑制剂来检验这种组合方法的实用性,并展示了一种耦合工作流程,该流程可以:(1)提供 6.4 倍的富集改进,从而能够从随机选择中识别出<10 nM 的化合物,从我们之前的方法中识别出<10 nM 的化合物的富集度提高了 1.5 倍;(2)快速探索商业试剂范围之外的相关化学空间;(3)使用生成式机器学习方法从大规模的计算枚举中“学习”SAR,并生成针对灵活受体的新型有效化合物分子,这些分子在具有相关物理化学空间的同时也具有活性;(4)生成超过 300 万个构思分子并运行 1935 次 FEP 模拟,识别出 69 个预测 IC <10 nM 的点子,358 个预测 IC <100 nM 的点子。报告的数据表明,结合基于反应的和生成式机器学习进行构思,可以提高有效化合物的富集度,超过之前描述的方法,并且有可能在预定义的效力和性质空间内快速加速新型化学物质的发现。

相似文献

1
Combining Cloud-Based Free-Energy Calculations, Synthetically Aware Enumerations, and Goal-Directed Generative Machine Learning for Rapid Large-Scale Chemical Exploration and Optimization.结合基于云的自由能计算、综合感知枚举和目标导向的生成式机器学习,实现快速大规模化学探索和优化。
J Chem Inf Model. 2020 Sep 28;60(9):4311-4325. doi: 10.1021/acs.jcim.0c00120. Epub 2020 Jun 19.
2
Reaction-Based Enumeration, Active Learning, and Free Energy Calculations To Rapidly Explore Synthetically Tractable Chemical Space and Optimize Potency of Cyclin-Dependent Kinase 2 Inhibitors.基于反应的枚举、主动学习和自由能计算,快速探索合成上可处理的化学空间并优化细胞周期蛋白依赖性激酶 2 抑制剂的效力。
J Chem Inf Model. 2019 Sep 23;59(9):3782-3793. doi: 10.1021/acs.jcim.9b00367. Epub 2019 Aug 22.
3
AutoDesigner, a Design Algorithm for Rapidly Exploring Large Chemical Space for Lead Optimization: Application to the Design and Synthesis of d-Amino Acid Oxidase Inhibitors.自动设计器,一种用于快速探索大型化学空间以进行先导优化的设计算法:在d-氨基酸氧化酶抑制剂设计与合成中的应用。
J Chem Inf Model. 2022 Apr 25;62(8):1905-1915. doi: 10.1021/acs.jcim.2c00072. Epub 2022 Apr 13.
4
Trends in Deep Learning for Property-driven Drug Design.基于属性的药物设计的深度学习趋势。
Curr Med Chem. 2021;28(38):7862-7886. doi: 10.2174/0929867328666210729115728.
5
Interpretable Machine Learning Models for Molecular Design of Tyrosine Kinase Inhibitors Using Variational Autoencoders and Perturbation-Based Approach of Chemical Space Exploration.基于变分自动编码器和基于扰动的化学空间探索方法的酪氨酸激酶抑制剂分子设计可解释机器学习模型。
Int J Mol Sci. 2022 Sep 24;23(19):11262. doi: 10.3390/ijms231911262.
6
Machine Learning Guided AQFEP: A Fast and Efficient Absolute Free Energy Perturbation Solution for Virtual Screening.机器学习引导的AQFEP:一种用于虚拟筛选的快速高效的绝对自由能微扰解决方案。
J Chem Theory Comput. 2024 Aug 15;20(16):7188-98. doi: 10.1021/acs.jctc.4c00399.
7
AutoDesigner - Core Design, a De Novo Design Algorithm for Chemical Scaffolds: Application to the Design and Synthesis of Novel Selective Wee1 Inhibitors.AutoDesigner - 核心设计,一种化学支架的从头设计算法:在新型选择性 Wee1 抑制剂的设计和合成中的应用。
J Chem Inf Model. 2024 Oct 14;64(19):7513-7524. doi: 10.1021/acs.jcim.4c01031. Epub 2024 Oct 3.
8
Actively Searching: Inverse Design of Novel Molecules with Simultaneously Optimized Properties.主动搜索:同时优化性质的新型分子的逆向设计。
J Phys Chem A. 2022 Jan 20;126(2):333-340. doi: 10.1021/acs.jpca.1c08191. Epub 2022 Jan 5.
9
A Cloud Computing Platform for Scalable Relative and Absolute Binding Free Energy Predictions: New Opportunities and Challenges for Drug Discovery.用于可扩展相对和绝对结合自由能预测的云计算平台:药物发现的新机遇和挑战。
J Chem Inf Model. 2021 Jun 28;61(6):2720-2732. doi: 10.1021/acs.jcim.0c01329. Epub 2021 Jun 4.
10
V-Dock: Fast Generation of Novel Drug-like Molecules Using Machine-Learning-Based Docking Score and Molecular Optimization.V-Dock:基于机器学习的对接评分和分子优化快速生成新型类药物分子。
Int J Mol Sci. 2021 Oct 27;22(21):11635. doi: 10.3390/ijms222111635.

引用本文的文献

1
Harnessing free energy calculations for kinome-wide selectivity in drug discovery campaigns with a Wee1 case study.通过Wee1案例研究,利用自由能计算实现药物研发中全激酶组选择性研究
Nat Commun. 2025 Aug 26;16(1):7962. doi: 10.1038/s41467-025-62722-w.
2
Exhaustive local chemical space exploration using a transformer model.使用变压器模型进行详尽的局部化学空间探索。
Nat Commun. 2024 Aug 25;15(1):7315. doi: 10.1038/s41467-024-51672-4.
3
Ubiquitin-specific protease 7 inhibitors reveal a differentiated mechanism of p53-driven anti-cancer activity.
泛素特异性蛋白酶7抑制剂揭示了p53驱动的抗癌活性的差异化机制。
iScience. 2024 Apr 9;27(5):109693. doi: 10.1016/j.isci.2024.109693. eCollection 2024 May 17.
4
Benchmarking Active Learning Protocols for Ligand-Binding Affinity Prediction.基于配体结合亲和力预测的主动学习协议基准测试。
J Chem Inf Model. 2024 Mar 25;64(6):1955-1965. doi: 10.1021/acs.jcim.4c00220. Epub 2024 Mar 6.
5
Pathfinder-Driven Chemical Space Exploration and Multiparameter Optimization in Tandem with Glide/IFD and QSAR-Based Active Learning Approach to Prioritize Design Ideas for FEP+ Calculations of SARS-CoV-2 PL Inhibitors.基于 Glide/IFD 的路径驱动化学空间探索和多参数优化,以及基于 QSAR 的主动学习方法,对 SARS-CoV-2 PL 抑制剂的 FEP+计算设计理念进行优先级排序。
Molecules. 2022 Dec 5;27(23):8569. doi: 10.3390/molecules27238569.
6
Augmented Hill-Climb increases reinforcement learning efficiency for language-based de novo molecule generation.增强爬山算法提高了基于语言的从头分子生成的强化学习效率。
J Cheminform. 2022 Oct 3;14(1):68. doi: 10.1186/s13321-022-00646-z.
7
On the Value of Using 3D Shape and Electrostatic Similarities in Deep Generative Methods.利用 3D 形状和静电相似性在深度生成方法中的价值。
J Chem Inf Model. 2022 Mar 28;62(6):1388-1398. doi: 10.1021/acs.jcim.1c01535. Epub 2022 Mar 10.
8
Design of Organic Electronic Materials With a Goal-Directed Generative Model Powered by Deep Neural Networks and High-Throughput Molecular Simulations.基于深度神经网络和高通量分子模拟驱动的目标导向生成模型的有机电子材料设计
Front Chem. 2022 Jan 17;9:800370. doi: 10.3389/fchem.2021.800370. eCollection 2021.
9
Efficient Hit-to-Lead Searching of Kinase Inhibitor Chemical Space via Computational Fragment Merging.通过计算片段融合进行激酶抑制剂化学空间的高效从头至先导化合物搜索。
J Chem Inf Model. 2021 Dec 27;61(12):5967-5987. doi: 10.1021/acs.jcim.1c00630. Epub 2021 Nov 11.
10
Accelerating high-throughput virtual screening through molecular pool-based active learning.通过基于分子库的主动学习加速高通量虚拟筛选
Chem Sci. 2021 Apr 29;12(22):7866-7881. doi: 10.1039/d0sc06805e.