• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于Transformer的分子设计中强化学习的评估

Evaluation of reinforcement learning in transformer-based molecular design.

作者信息

He Jiazhen, Tibo Alessandro, Janet Jon Paul, Nittinger Eva, Tyrchan Christian, Czechtizky Werngard, Engkvist Ola

机构信息

Molecular AI, Discovery Sciences, R&D, AstraZeneca, Gothenburg, Sweden.

Medicinal Chemistry, Research and Early Development, Respiratory and Immunology (R&I), BioPharmaceuticals R&D, AstraZeneca, Gothenburg, Sweden.

出版信息

J Cheminform. 2024 Aug 8;16(1):95. doi: 10.1186/s13321-024-00887-0.

DOI:10.1186/s13321-024-00887-0
PMID:39118113
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11312936/
Abstract

Designing compounds with a range of desirable properties is a fundamental challenge in drug discovery. In pre-clinical early drug discovery, novel compounds are often designed based on an already existing promising starting compound through structural modifications for further property optimization. Recently, transformer-based deep learning models have been explored for the task of molecular optimization by training on pairs of similar molecules. This provides a starting point for generating similar molecules to a given input molecule, but has limited flexibility regarding user-defined property profiles. Here, we evaluate the effect of reinforcement learning on transformer-based molecular generative models. The generative model can be considered as a pre-trained model with knowledge of the chemical space close to an input compound, while reinforcement learning can be viewed as a tuning phase, steering the model towards chemical space with user-specific desirable properties. The evaluation of two distinct tasks-molecular optimization and scaffold discovery-suggest that reinforcement learning could guide the transformer-based generative model towards the generation of more compounds of interest. Additionally, the impact of pre-trained models, learning steps and learning rates are investigated.Scientific contributionOur study investigates the effect of reinforcement learning on a transformer-based generative model initially trained for generating molecules similar to starting molecules. The reinforcement learning framework is applied to facilitate multiparameter optimisation of starting molecules. This approach allows for more flexibility for optimizing user-specific property profiles and helps finding more ideas of interest.

摘要

设计具有一系列理想特性的化合物是药物研发中的一项基本挑战。在临床前早期药物研发中,新型化合物通常是基于已有的有前景的起始化合物,通过结构修饰来进一步优化性质而设计的。最近,基于Transformer的深度学习模型已被探索用于通过对相似分子对进行训练来完成分子优化任务。这为生成与给定输入分子相似的分子提供了一个起点,但在用户定义的性质概况方面灵活性有限。在此,我们评估强化学习对基于Transformer的分子生成模型的影响。生成模型可被视为一个预训练模型,它了解靠近输入化合物的化学空间,而强化学习可被视为一个调优阶段,引导模型朝着具有用户特定理想性质的化学空间发展。对两个不同任务——分子优化和骨架发现——的评估表明,强化学习可以引导基于Transformer的生成模型生成更多感兴趣的化合物。此外,还研究了预训练模型、学习步骤和学习率的影响。

科学贡献

我们的研究调查了强化学习对最初为生成与起始分子相似的分子而训练的基于Transformer的生成模型的影响。应用强化学习框架以促进起始分子的多参数优化。这种方法在优化用户特定性质概况方面具有更大的灵活性,并有助于找到更多感兴趣的思路。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/759b/11312936/500330c42524/13321_2024_887_Fig19_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/759b/11312936/bb09f481476e/13321_2024_887_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/759b/11312936/bb67025fa4a7/13321_2024_887_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/759b/11312936/9f7172c71e93/13321_2024_887_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/759b/11312936/2f322a7ff53b/13321_2024_887_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/759b/11312936/4058c82e0334/13321_2024_887_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/759b/11312936/168e9eb36b5b/13321_2024_887_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/759b/11312936/489c0eb128f9/13321_2024_887_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/759b/11312936/910fb030c3c3/13321_2024_887_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/759b/11312936/2d0054711d28/13321_2024_887_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/759b/11312936/1e469abc7786/13321_2024_887_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/759b/11312936/879850bbefe6/13321_2024_887_Fig11_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/759b/11312936/2e7ff2861456/13321_2024_887_Fig12_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/759b/11312936/11ec9c35e5f2/13321_2024_887_Fig13_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/759b/11312936/0009e729c715/13321_2024_887_Fig14_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/759b/11312936/38b1c22ecf8b/13321_2024_887_Fig15_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/759b/11312936/0e9ef9781e3e/13321_2024_887_Fig16_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/759b/11312936/ffd5e219349d/13321_2024_887_Fig17_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/759b/11312936/81da783ecf70/13321_2024_887_Fig18_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/759b/11312936/500330c42524/13321_2024_887_Fig19_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/759b/11312936/bb09f481476e/13321_2024_887_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/759b/11312936/bb67025fa4a7/13321_2024_887_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/759b/11312936/9f7172c71e93/13321_2024_887_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/759b/11312936/2f322a7ff53b/13321_2024_887_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/759b/11312936/4058c82e0334/13321_2024_887_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/759b/11312936/168e9eb36b5b/13321_2024_887_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/759b/11312936/489c0eb128f9/13321_2024_887_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/759b/11312936/910fb030c3c3/13321_2024_887_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/759b/11312936/2d0054711d28/13321_2024_887_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/759b/11312936/1e469abc7786/13321_2024_887_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/759b/11312936/879850bbefe6/13321_2024_887_Fig11_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/759b/11312936/2e7ff2861456/13321_2024_887_Fig12_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/759b/11312936/11ec9c35e5f2/13321_2024_887_Fig13_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/759b/11312936/0009e729c715/13321_2024_887_Fig14_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/759b/11312936/38b1c22ecf8b/13321_2024_887_Fig15_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/759b/11312936/0e9ef9781e3e/13321_2024_887_Fig16_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/759b/11312936/ffd5e219349d/13321_2024_887_Fig17_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/759b/11312936/81da783ecf70/13321_2024_887_Fig18_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/759b/11312936/500330c42524/13321_2024_887_Fig19_HTML.jpg

相似文献

1
Evaluation of reinforcement learning in transformer-based molecular design.基于Transformer的分子设计中强化学习的评估
J Cheminform. 2024 Aug 8;16(1):95. doi: 10.1186/s13321-024-00887-0.
2
Optimization of binding affinities in chemical space with generative pre-trained transformer and deep reinforcement learning.利用生成式预训练变换器和深度强化学习在化学空间中优化结合亲和力
F1000Res. 2024 Feb 20;12:757. doi: 10.12688/f1000research.130936.2. eCollection 2023.
3
DrugEx v3: scaffold-constrained drug design with graph transformer-based reinforcement learning.DrugEx v3:基于图变换器强化学习的支架约束药物设计
J Cheminform. 2023 Feb 20;15(1):24. doi: 10.1186/s13321-023-00694-z.
4
Transformer-based molecular optimization beyond matched molecular pairs.超越匹配分子对的基于Transformer的分子优化。
J Cheminform. 2022 Mar 28;14(1):18. doi: 10.1186/s13321-022-00599-3.
5
cMolGPT: A Conditional Generative Pre-Trained Transformer for Target-Specific De Novo Molecular Generation.cMolGPT:一种用于靶向特定从头分子生成的条件生成式预训练转换器。
Molecules. 2023 May 30;28(11):4430. doi: 10.3390/molecules28114430.
6
AB-Gen: Antibody Library Design with Generative Pre-trained Transformer and Deep Reinforcement Learning.AB-Gen:基于生成式预训练变换器和深度强化学习的抗体库设计
Genomics Proteomics Bioinformatics. 2023 Oct;21(5):1043-1053. doi: 10.1016/j.gpb.2023.03.004. Epub 2023 Jun 24.
7
FSM-DDTR: End-to-end feedback strategy for multi-objective De Novo drug design using transformers.FSM-DDTR:使用变压器的多目标从头药物设计的端到端反馈策略。
Comput Biol Med. 2023 Sep;164:107285. doi: 10.1016/j.compbiomed.2023.107285. Epub 2023 Jul 31.
8
Molecular optimization by capturing chemist's intuition using deep neural networks.通过使用深度神经网络捕捉化学家的直觉进行分子优化。
J Cheminform. 2021 Mar 20;13(1):26. doi: 10.1186/s13321-021-00497-0.
9
Exhaustive local chemical space exploration using a transformer model.使用变压器模型进行详尽的局部化学空间探索。
Nat Commun. 2024 Aug 25;15(1):7315. doi: 10.1038/s41467-024-51672-4.
10
Enhancing reinforcement learning for de novo molecular design applying self-attention mechanisms.应用自注意力机制增强从头分子设计中的强化学习。
Brief Bioinform. 2023 Sep 22;24(6). doi: 10.1093/bib/bbad368.

引用本文的文献

1
Generative design of singlet fission materials leveraging a fragment-oriented database.利用面向片段的数据库进行单线态裂变材料的生成式设计。
Chem Sci. 2025 Aug 25. doi: 10.1039/d5sc03184b.
2
PepINVENT: generative peptide design beyond natural amino acids.PepINVENT:超越天然氨基酸的生成性肽设计。
Chem Sci. 2025 Apr 16;16(20):8682-8696. doi: 10.1039/d4sc07642g. eCollection 2025 May 21.
3
Optimal Molecular Design: Generative Active Learning Combining REINVENT with Precise Binding Free Energy Ranking Simulations.最优分子设计:结合REINVENT与精确结合自由能排序模拟的生成式主动学习

本文引用的文献

1
Reinvent 4: Modern AI-driven generative molecule design.重塑4:现代人工智能驱动的生成式分子设计。
J Cheminform. 2024 Feb 21;16(1):20. doi: 10.1186/s13321-024-00812-5.
2
ScaffoldGVAE: scaffold generation and hopping of drug molecules via a variational autoencoder based on multi-view graph neural networks.支架生成变分自编码器(ScaffoldGVAE):基于多视图图神经网络的变分自编码器实现药物分子的支架生成与跳跃
J Cheminform. 2023 Oct 4;15(1):91. doi: 10.1186/s13321-023-00766-0.
3
DrugEx v3: scaffold-constrained drug design with graph transformer-based reinforcement learning.
J Chem Theory Comput. 2024 Sep 3;20(18):8308-28. doi: 10.1021/acs.jctc.4c00576.
DrugEx v3:基于图变换器强化学习的支架约束药物设计
J Cheminform. 2023 Feb 20;15(1):24. doi: 10.1186/s13321-023-00694-z.
4
Transformer-based molecular optimization beyond matched molecular pairs.超越匹配分子对的基于Transformer的分子优化。
J Cheminform. 2022 Mar 28;14(1):18. doi: 10.1186/s13321-022-00599-3.
5
Deep scaffold hopping with multimodal transformer neural networks.基于多模态变压器神经网络的深度骨架跳跃
J Cheminform. 2021 Nov 13;13(1):87. doi: 10.1186/s13321-021-00565-5.
6
MolGPT: Molecular Generation Using a Transformer-Decoder Model.MolGPT:基于 Transformer-Decoder 模型的分子生成。
J Chem Inf Model. 2022 May 9;62(9):2064-2076. doi: 10.1021/acs.jcim.1c00600. Epub 2021 Oct 25.
7
LibINVENT: Reaction-based Generative Scaffold Decoration for Library Design.基于反应的生成性支架修饰库设计
J Chem Inf Model. 2022 May 9;62(9):2046-2063. doi: 10.1021/acs.jcim.1c00469. Epub 2021 Aug 30.
8
SyntaLinker: automatic fragment linking with deep conditional transformer neural networks.SyntaLinker:基于深度条件变压器神经网络的自动片段链接
Chem Sci. 2020 Jul 22;11(31):8312-8322. doi: 10.1039/d0sc03126g.
9
Scaffold-based molecular design with a graph generative model.基于支架的分子设计与图形生成模型。
Chem Sci. 2019 Dec 3;11(4):1153-1164. doi: 10.1039/c9sc04503a.
10
Masked graph modeling for molecule generation.掩蔽图建模用于分子生成。
Nat Commun. 2021 May 26;12(1):3156. doi: 10.1038/s41467-021-23415-2.