• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

PromptSMILES:在化学语言模型中促进支架修饰和片段连接。

PromptSMILES: prompting for scaffold decoration and fragment linking in chemical language models.

作者信息

Thomas Morgan, Ahmad Mazen, Tresadern Gary, de Fabritiis Gianni

机构信息

Computational Science Laboratory, Universitat Pompeu Fabra, Barcelona Biomedical Research Park (PRBB), C Dr. Aguiader 88, 08003, Barcelona, Spain.

In Silico Discovery, Janssen Pharmaceutica N. V., Turnhoutseweg 30, 2340, Beerse, Belgium.

出版信息

J Cheminform. 2024 Jul 4;16(1):77. doi: 10.1186/s13321-024-00866-5.

DOI:10.1186/s13321-024-00866-5
PMID:38965600
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11225391/
Abstract

SMILES-based generative models are amongst the most robust and successful recent methods used to augment drug design. They are typically used for complete de novo generation, however, scaffold decoration and fragment linking applications are sometimes desirable which requires a different grammar, architecture, training dataset and therefore, re-training of a new model. In this work, we describe a simple procedure to conduct constrained molecule generation with a SMILES-based generative model to extend applicability to scaffold decoration and fragment linking by providing SMILES prompts, without the need for re-training. In combination with reinforcement learning, we show that pre-trained, decoder-only models adapt to these applications quickly and can further optimize molecule generation towards a specified objective. We compare the performance of this approach to a variety of orthogonal approaches and show that performance is comparable or better. For convenience, we provide an easy-to-use python package to facilitate model sampling which can be found on GitHub and the Python Package Index.Scientific contributionThis novel method extends an autoregressive chemical language model to scaffold decoration and fragment linking scenarios. This doesn't require re-training, the use of a bespoke grammar, or curation of a custom dataset, as commonly required by other approaches.

摘要

基于SMILES的生成模型是近年来用于药物设计增强的最强大、最成功的方法之一。它们通常用于完全从头生成,然而,有时需要进行骨架修饰和片段连接应用,这需要不同的语法、架构、训练数据集,因此需要重新训练新模型。在这项工作中,我们描述了一种简单的程序,通过基于SMILES的生成模型进行受限分子生成,通过提供SMILES提示将适用性扩展到骨架修饰和片段连接,而无需重新训练。结合强化学习,我们表明预训练的、仅解码器的模型能够快速适应这些应用,并可以朝着指定目标进一步优化分子生成。我们将这种方法的性能与各种正交方法进行比较,结果表明性能相当或更好。为方便起见,我们提供了一个易于使用的Python包来促进模型采样,可在GitHub和Python Package Index上找到。

科学贡献

这种新方法将自回归化学语言模型扩展到骨架修饰和片段连接场景。这不需要像其他方法通常要求的那样重新训练、使用定制语法或策划定制数据集。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/daf3/11225391/dc072969f364/13321_2024_866_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/daf3/11225391/99ef900f97b2/13321_2024_866_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/daf3/11225391/2deb203bc261/13321_2024_866_Figa_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/daf3/11225391/5008c141dbd1/13321_2024_866_Figb_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/daf3/11225391/4846e316cc48/13321_2024_866_Figc_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/daf3/11225391/2a5456bc5b94/13321_2024_866_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/daf3/11225391/ad41c53e83e8/13321_2024_866_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/daf3/11225391/f9bfec7cdcdd/13321_2024_866_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/daf3/11225391/eb367c047863/13321_2024_866_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/daf3/11225391/dc072969f364/13321_2024_866_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/daf3/11225391/99ef900f97b2/13321_2024_866_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/daf3/11225391/2deb203bc261/13321_2024_866_Figa_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/daf3/11225391/5008c141dbd1/13321_2024_866_Figb_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/daf3/11225391/4846e316cc48/13321_2024_866_Figc_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/daf3/11225391/2a5456bc5b94/13321_2024_866_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/daf3/11225391/ad41c53e83e8/13321_2024_866_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/daf3/11225391/f9bfec7cdcdd/13321_2024_866_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/daf3/11225391/eb367c047863/13321_2024_866_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/daf3/11225391/dc072969f364/13321_2024_866_Fig6_HTML.jpg

相似文献

1
PromptSMILES: prompting for scaffold decoration and fragment linking in chemical language models.PromptSMILES:在化学语言模型中促进支架修饰和片段连接。
J Cheminform. 2024 Jul 4;16(1):77. doi: 10.1186/s13321-024-00866-5.
2
SMILES-based deep generative scaffold decorator for de-novo drug design.用于从头药物设计的基于SMILES的深度生成支架修饰器。
J Cheminform. 2020 May 29;12(1):38. doi: 10.1186/s13321-020-00441-8.
3
Automated Generation of Novel Fragments Using Screening Data, a Dual SMILES Autoencoder, Transfer Learning and Syntax Correction.利用筛选数据、双 SMILES 自动编码器、迁移学习和语法修正自动化生成新片段。
J Chem Inf Model. 2021 Jun 28;61(6):2547-2559. doi: 10.1021/acs.jcim.0c01226. Epub 2021 May 24.
4
MolScore: a scoring, evaluation and benchmarking framework for generative models in de novo drug design.MolScore:一种用于从头药物设计中生成模型的评分、评估和基准测试框架。
J Cheminform. 2024 May 30;16(1):64. doi: 10.1186/s13321-024-00861-w.
5
Unlocking comprehensive molecular design across all scenarios with large language model and unordered chemical language.利用大语言模型和无序化学语言解锁所有场景下的全面分子设计。
Chem Sci. 2024 Jul 29;15(34):13727-13740. doi: 10.1039/d4sc03744h. eCollection 2024 Aug 28.
6
Augmented Hill-Climb increases reinforcement learning efficiency for language-based de novo molecule generation.增强爬山算法提高了基于语言的从头分子生成的强化学习效率。
J Cheminform. 2022 Oct 3;14(1):68. doi: 10.1186/s13321-022-00646-z.
7
Scaffold-Constrained Molecular Generation.支架约束分子生成。
J Chem Inf Model. 2020 Dec 28;60(12):5637-5646. doi: 10.1021/acs.jcim.0c01015. Epub 2020 Dec 10.
8
Tree-Invent: A Novel Multipurpose Molecular Generative Model Constrained with a Topological Tree.树创:一种新型的基于拓扑树约束的多功能分子生成模型。
J Chem Inf Model. 2023 Nov 27;63(22):7067-7082. doi: 10.1021/acs.jcim.3c01626. Epub 2023 Nov 14.
9
DrugEx v3: scaffold-constrained drug design with graph transformer-based reinforcement learning.DrugEx v3:基于图变换器强化学习的支架约束药物设计
J Cheminform. 2023 Feb 20;15(1):24. doi: 10.1186/s13321-023-00694-z.
10
Generative Adversarial Networks for De Novo Molecular Design.生成对抗网络用于从头分子设计。
Mol Inform. 2021 Oct;40(10):e2100045. doi: 10.1002/minf.202100045. Epub 2021 Jul 6.

引用本文的文献

1
Machine Learning for Toxicity Prediction Using Chemical Structures: Pillars for Success in the Real World.利用化学结构进行毒性预测的机器学习:在现实世界中取得成功的支柱。
Chem Res Toxicol. 2025 May 19;38(5):759-807. doi: 10.1021/acs.chemrestox.5c00033. Epub 2025 May 2.
2
ACEGEN: Reinforcement Learning of Generative Chemical Agents for Drug Discovery.ACEGEN:用于药物发现的生成式化学试剂的强化学习。
J Chem Inf Model. 2024 Aug 12;64(15):5900-5911. doi: 10.1021/acs.jcim.4c00895. Epub 2024 Aug 2.

本文引用的文献

1
Reinvent 4: Modern AI-driven generative molecule design.重塑4:现代人工智能驱动的生成式分子设计。
J Cheminform. 2024 Feb 21;16(1):20. doi: 10.1186/s13321-024-00812-5.
2
Faster and more diverse de novo molecular optimization with double-loop reinforcement learning using augmented SMILES.使用增强型 SMILES 进行双环强化学习,实现更快、更多样的从头分子优化。
J Comput Aided Mol Des. 2023 Aug;37(8):373-394. doi: 10.1007/s10822-023-00512-6. Epub 2023 Jun 17.
3
Generative and reinforcement learning approaches for the automated de novo design of bioactive compounds.
用于生物活性化合物自动从头设计的生成式和强化学习方法。
Commun Chem. 2022 Oct 18;5(1):129. doi: 10.1038/s42004-022-00733-0.
4
Leveraging molecular structure and bioactivity with chemical language models for de novo drug design.利用分子结构和生物活性与化学语言模型进行从头药物设计。
Nat Commun. 2023 Jan 7;14(1):114. doi: 10.1038/s41467-022-35692-6.
5
Drug Design Using Reinforcement Learning with Graph-Based Deep Generative Models.基于图的深度生成模型的强化学习药物设计。
J Chem Inf Model. 2022 Oct 24;62(20):4863-4872. doi: 10.1021/acs.jcim.2c00838. Epub 2022 Oct 11.
6
Augmented Hill-Climb increases reinforcement learning efficiency for language-based de novo molecule generation.增强爬山算法提高了基于语言的从头分子生成的强化学习效率。
J Cheminform. 2022 Oct 3;14(1):68. doi: 10.1186/s13321-022-00646-z.
7
Effective Reaction-Based Strategy for Kinase Targets: A Case Study on MERTK Inhibitors.基于反应的有效激酶靶点策略:以 MERTK 抑制剂为例。
J Chem Inf Model. 2022 Apr 11;62(7):1654-1668. doi: 10.1021/acs.jcim.2c00068. Epub 2022 Mar 30.
8
Generative machine learning for de novo drug discovery: A systematic review.生成式机器学习在从头药物发现中的应用:系统评价。
Comput Biol Med. 2022 Jun;145:105403. doi: 10.1016/j.compbiomed.2022.105403. Epub 2022 Mar 13.
9
LibINVENT: Reaction-based Generative Scaffold Decoration for Library Design.基于反应的生成性支架修饰库设计
J Chem Inf Model. 2022 May 9;62(9):2046-2063. doi: 10.1021/acs.jcim.1c00469. Epub 2021 Aug 30.
10
SyntaLinker: automatic fragment linking with deep conditional transformer neural networks.SyntaLinker:基于深度条件变压器神经网络的自动片段链接
Chem Sci. 2020 Jul 22;11(31):8312-8322. doi: 10.1039/d0sc03126g.