• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

解释并避免小分子目标导向生成中的失败模式。

Explaining and avoiding failure modes in goal-directed generation of small molecules.

作者信息

Langevin Maxime, Vuilleumier Rodolphe, Bianciotto Marc

机构信息

Molecular Design Sciences - Integrated Drug Discovery, Sanofi R&D, 94400, Vitry-sur-Seine, France.

PASTEUR, Département de chimie, École Normale Supérieure, PSL University, Sorbonne Université, CNRS, 75005, Paris, France.

出版信息

J Cheminform. 2022 Apr 1;14(1):20. doi: 10.1186/s13321-022-00601-y.

DOI:10.1186/s13321-022-00601-y
PMID:35365218
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8973583/
Abstract

Despite growing interest and success in automated in-silico molecular design, questions remain regarding the ability of goal-directed generation algorithms to perform unbiased exploration of novel chemical spaces. A specific phenomenon has recently been highlighted: goal-directed generation guided with machine learning models produce molecules with high scores according to the optimization model, but low scores according to control models, even when trained on the same data distribution and the same target. In this work, we show that this worrisome behavior is actually due to issues with the predictive models and not the goal-directed generation algorithms. We show that with appropriate predictive models, this issue can be resolved, and molecules generated have high scores according to both the optimization and the control models.

摘要

尽管在自动化计算机辅助分子设计方面的兴趣与成果不断增加,但关于目标导向生成算法对新型化学空间进行无偏差探索的能力仍存在问题。最近凸显了一个特定现象:即使在相同的数据分布和相同目标上进行训练,由机器学习模型引导的目标导向生成会产生根据优化模型得分高但根据控制模型得分低的分子。在这项工作中,我们表明这种令人担忧的行为实际上是由于预测模型的问题而非目标导向生成算法。我们表明,使用适当的预测模型,这个问题可以得到解决,并且生成的分子根据优化模型和控制模型得分都很高。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5436/8973583/ace05538b21d/13321_2022_601_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5436/8973583/fb28410b235d/13321_2022_601_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5436/8973583/206d19eefe4e/13321_2022_601_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5436/8973583/d44478dbe9d9/13321_2022_601_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5436/8973583/a45cce98fcb7/13321_2022_601_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5436/8973583/cc949fb23d46/13321_2022_601_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5436/8973583/7fee872214a5/13321_2022_601_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5436/8973583/4c2342eb8492/13321_2022_601_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5436/8973583/ace05538b21d/13321_2022_601_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5436/8973583/fb28410b235d/13321_2022_601_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5436/8973583/206d19eefe4e/13321_2022_601_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5436/8973583/d44478dbe9d9/13321_2022_601_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5436/8973583/a45cce98fcb7/13321_2022_601_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5436/8973583/cc949fb23d46/13321_2022_601_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5436/8973583/7fee872214a5/13321_2022_601_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5436/8973583/4c2342eb8492/13321_2022_601_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5436/8973583/ace05538b21d/13321_2022_601_Fig8_HTML.jpg

相似文献

1
Explaining and avoiding failure modes in goal-directed generation of small molecules.解释并避免小分子目标导向生成中的失败模式。
J Cheminform. 2022 Apr 1;14(1):20. doi: 10.1186/s13321-022-00601-y.
2
Molecular Assays Simulator to Unravel Predictors Hacking in Goal-Directed Molecular Generations.分子分析模拟器,揭示定向分子生成中的黑客预测因子。
J Chem Inf Model. 2023 Jul 10;63(13):3983-3998. doi: 10.1021/acs.jcim.3c00195. Epub 2023 Jun 22.
3
Optimization of binding affinities in chemical space with generative pre-trained transformer and deep reinforcement learning.利用生成式预训练变换器和深度强化学习在化学空间中优化结合亲和力
F1000Res. 2024 Feb 20;12:757. doi: 10.12688/f1000research.130936.2. eCollection 2023.
4
Scaffold-Constrained Molecular Generation.支架约束分子生成。
J Chem Inf Model. 2020 Dec 28;60(12):5637-5646. doi: 10.1021/acs.jcim.0c01015. Epub 2020 Dec 10.
5
Comprehensive assessment of deep generative architectures for de novo drug design.从头设计药物的深度生成式架构的综合评估。
Brief Bioinform. 2022 Jan 17;23(1). doi: 10.1093/bib/bbab544.
6
Generative and reinforcement learning approaches for the automated de novo design of bioactive compounds.用于生物活性化合物自动从头设计的生成式和强化学习方法。
Commun Chem. 2022 Oct 18;5(1):129. doi: 10.1038/s42004-022-00733-0.
7
Combining Cloud-Based Free-Energy Calculations, Synthetically Aware Enumerations, and Goal-Directed Generative Machine Learning for Rapid Large-Scale Chemical Exploration and Optimization.结合基于云的自由能计算、综合感知枚举和目标导向的生成式机器学习,实现快速大规模化学探索和优化。
J Chem Inf Model. 2020 Sep 28;60(9):4311-4325. doi: 10.1021/acs.jcim.0c00120. Epub 2020 Jun 19.
8
De novo generation of dual-target ligands using adversarial training and reinforcement learning.使用对抗训练和强化学习生成双靶配体。
Brief Bioinform. 2021 Nov 5;22(6). doi: 10.1093/bib/bbab333.
9
Applications of Deep Learning in Molecule Generation and Molecular Property Prediction.深度学习在分子生成和分子性质预测中的应用。
Acc Chem Res. 2021 Jan 19;54(2):263-270. doi: 10.1021/acs.accounts.0c00699. Epub 2020 Dec 28.
10
DrugEx v2: de novo design of drug molecules by Pareto-based multi-objective reinforcement learning in polypharmacology.DrugEx v2:基于帕累托的多目标强化学习在多药理学中从头设计药物分子
J Cheminform. 2021 Nov 12;13(1):85. doi: 10.1186/s13321-021-00561-9.

引用本文的文献

1
Boltz-2: Towards Accurate and Efficient Binding Affinity Prediction.Boltz-2:迈向准确高效的结合亲和力预测
bioRxiv. 2025 Jun 18:2025.06.14.659707. doi: 10.1101/2025.06.14.659707.
2
Generative Deep Learning for de Novo Drug Design─A Chemical Space Odyssey.用于从头药物设计的生成式深度学习——一场化学空间奥德赛。
J Chem Inf Model. 2025 Jul 28;65(14):7352-7372. doi: 10.1021/acs.jcim.5c00641. Epub 2025 Jul 9.
3
A data-driven generative strategy to avoid reward hacking in multi-objective molecular design.一种数据驱动的生成策略,用于避免多目标分子设计中的奖励操纵。

本文引用的文献

1
Comparison of structure- and ligand-based scoring functions for deep generative models: a GPCR case study.深度生成模型中基于结构和配体的评分函数比较:以G蛋白偶联受体为例的研究
J Cheminform. 2021 May 13;13(1):39. doi: 10.1186/s13321-021-00516-0.
2
Critical assessment of AI in drug discovery.人工智能在药物发现中的关键性评估。
Expert Opin Drug Discov. 2021 Sep;16(9):937-947. doi: 10.1080/17460441.2021.1915982. Epub 2021 Apr 19.
3
Could graph neural networks learn better molecular representation for drug discovery? A comparison study of descriptor-based and graph-based models.
Nat Commun. 2025 Mar 11;16(1):2409. doi: 10.1038/s41467-025-57582-3.
4
Human-in-the-loop active learning for goal-oriented molecule generation.用于目标导向分子生成的人工参与式主动学习
J Cheminform. 2024 Dec 9;16(1):138. doi: 10.1186/s13321-024-00924-y.
5
ACEGEN: Reinforcement Learning of Generative Chemical Agents for Drug Discovery.ACEGEN:用于药物发现的生成式化学试剂的强化学习。
J Chem Inf Model. 2024 Aug 12;64(15):5900-5911. doi: 10.1021/acs.jcim.4c00895. Epub 2024 Aug 2.
6
MolScore: a scoring, evaluation and benchmarking framework for generative models in de novo drug design.MolScore:一种用于从头药物设计中生成模型的评分、评估和基准测试框架。
J Cheminform. 2024 May 30;16(1):64. doi: 10.1186/s13321-024-00861-w.
7
Development of scoring-assisted generative exploration (SAGE) and its application to dual inhibitor design for acetylcholinesterase and monoamine oxidase B.评分辅助生成性探索(SAGE)的开发及其在乙酰胆碱酯酶和单胺氧化酶B双重抑制剂设计中的应用。
J Cheminform. 2024 May 24;16(1):59. doi: 10.1186/s13321-024-00845-w.
8
Augmented Hill-Climb increases reinforcement learning efficiency for language-based de novo molecule generation.增强爬山算法提高了基于语言的从头分子生成的强化学习效率。
J Cheminform. 2022 Oct 3;14(1):68. doi: 10.1186/s13321-022-00646-z.
图神经网络能否为药物发现学习更好的分子表示?基于描述符和基于图的模型的比较研究。
J Cheminform. 2021 Feb 17;13(1):12. doi: 10.1186/s13321-020-00479-8.
4
On failure modes in molecule generation and optimization.关于分子生成和优化中的失效模式。
Drug Discov Today Technol. 2019 Dec;32-33:55-63. doi: 10.1016/j.ddtec.2020.09.003. Epub 2020 Oct 24.
5
LIT-PCBA: An Unbiased Data Set for Machine Learning and Virtual Screening.LIT-PCBA:用于机器学习和虚拟筛选的无偏数据集。
J Chem Inf Model. 2020 Sep 28;60(9):4263-4273. doi: 10.1021/acs.jcim.0c00155. Epub 2020 Apr 23.
6
The Surprising Creativity of Digital Evolution: A Collection of Anecdotes from the Evolutionary Computation and Artificial Life Research Communities.数字进化的惊人创造力:进化计算和人工生命研究社区的轶事集。
Artif Life. 2020 Spring;26(2):274-306. doi: 10.1162/artl_a_00319. Epub 2020 Apr 9.
7
The Synthesizability of Molecules Proposed by Generative Models.生成式模型提出的分子可合成性。
J Chem Inf Model. 2020 Dec 28;60(12):5714-5723. doi: 10.1021/acs.jcim.0c00174. Epub 2020 Apr 17.
8
Efficient multi-objective molecular optimization in a continuous latent space.连续潜在空间中的高效多目标分子优化。
Chem Sci. 2019 Jul 8;10(34):8016-8024. doi: 10.1039/c9sc01928f. eCollection 2019 Sep 14.
9
A graph-based genetic algorithm and generative model/Monte Carlo tree search for the exploration of chemical space.一种基于图的遗传算法和生成模型/蒙特卡罗树搜索方法用于化学空间探索。
Chem Sci. 2019 Feb 11;10(12):3567-3572. doi: 10.1039/c8sc05372c. eCollection 2019 Mar 28.
10
GuacaMol: Benchmarking Models for de Novo Molecular Design.GuacaMol:从头设计分子的模型基准测试。
J Chem Inf Model. 2019 Mar 25;59(3):1096-1108. doi: 10.1021/acs.jcim.8b00839. Epub 2019 Mar 19.