• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于靶向分子生成的面向多样性的深度强化学习

Diversity oriented Deep Reinforcement Learning for targeted molecule generation.

作者信息

Pereira Tiago, Abbasi Maryam, Ribeiro Bernardete, Arrais Joel P

机构信息

Department of Informatics Engineering, Centre for Informatics and Systems of the University of Coimbra, University of Coimbra, Pinhal de Marrocos, Coimbra, Portugal.

出版信息

J Cheminform. 2021 Mar 9;13(1):21. doi: 10.1186/s13321-021-00498-z.

DOI:10.1186/s13321-021-00498-z
PMID:33750461
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7944916/
Abstract

In this work, we explore the potential of deep learning to streamline the process of identifying new potential drugs through the computational generation of molecules with interesting biological properties. Two deep neural networks compose our targeted generation framework: the Generator, which is trained to learn the building rules of valid molecules employing SMILES strings notation, and the Predictor which evaluates the newly generated compounds by predicting their affinity for the desired target. Then, the Generator is optimized through Reinforcement Learning to produce molecules with bespoken properties. The innovation of this approach is the exploratory strategy applied during the reinforcement training process that seeks to add novelty to the generated compounds. This training strategy employs two Generators interchangeably to sample new SMILES: the initially trained model that will remain fixed and a copy of the previous one that will be updated during the training to uncover the most promising molecules. The evolution of the reward assigned by the Predictor determines how often each one is employed to select the next token of the molecule. This strategy establishes a compromise between the need to acquire more information about the chemical space and the need to sample new molecules, with the experience gained so far. To demonstrate the effectiveness of the method, the Generator is trained to design molecules with an optimized coefficient of partition and also high inhibitory power against the Adenosine [Formula: see text] and [Formula: see text] opioid receptors. The results reveal that the model can effectively adjust the newly generated molecules towards the wanted direction. More importantly, it was possible to find promising sets of unique and diverse molecules, which was the main purpose of the newly implemented strategy.

摘要

在这项工作中,我们探索了深度学习的潜力,通过计算生成具有有趣生物学特性的分子,来简化识别新潜在药物的过程。我们的目标生成框架由两个深度神经网络组成:生成器,它经过训练以学习使用SMILES字符串表示法的有效分子的构建规则;预测器,它通过预测新生成化合物对所需靶点的亲和力来评估这些化合物。然后,通过强化学习对生成器进行优化,以生成具有特定性质的分子。这种方法的创新之处在于在强化训练过程中应用的探索策略,该策略旨在为生成的化合物增添新颖性。这种训练策略交替使用两个生成器来采样新的SMILES:初始训练的模型将保持不变,以及前一个模型的副本,该副本将在训练过程中更新,以发现最有前景的分子。预测器分配的奖励的演变决定了每个生成器用于选择分子的下一个令牌的频率。这种策略在获取更多关于化学空间信息的需求与采样新分子的需求之间,以及与迄今为止获得的经验之间达成了妥协。为了证明该方法的有效性,对生成器进行训练,以设计具有优化分配系数且对腺苷[公式:见正文]和[公式:见正文]阿片受体具有高抑制能力的分子。结果表明,该模型可以有效地将新生成的分子朝着期望的方向调整。更重要的是,有可能找到一组有前景的独特且多样的分子,这是新实施策略的主要目的。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4aee/7944916/21d10f6a8902/13321_2021_498_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4aee/7944916/c1af330193a1/13321_2021_498_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4aee/7944916/c8acbb4d533d/13321_2021_498_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4aee/7944916/14de493f98be/13321_2021_498_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4aee/7944916/5ad1114497c6/13321_2021_498_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4aee/7944916/d3ea3b69c79f/13321_2021_498_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4aee/7944916/adb11f347557/13321_2021_498_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4aee/7944916/13ecaadb0f82/13321_2021_498_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4aee/7944916/9f3b914f2186/13321_2021_498_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4aee/7944916/8d741a4fb1fa/13321_2021_498_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4aee/7944916/21d10f6a8902/13321_2021_498_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4aee/7944916/c1af330193a1/13321_2021_498_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4aee/7944916/c8acbb4d533d/13321_2021_498_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4aee/7944916/14de493f98be/13321_2021_498_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4aee/7944916/5ad1114497c6/13321_2021_498_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4aee/7944916/d3ea3b69c79f/13321_2021_498_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4aee/7944916/adb11f347557/13321_2021_498_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4aee/7944916/13ecaadb0f82/13321_2021_498_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4aee/7944916/9f3b914f2186/13321_2021_498_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4aee/7944916/8d741a4fb1fa/13321_2021_498_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4aee/7944916/21d10f6a8902/13321_2021_498_Fig10_HTML.jpg

相似文献

1
Diversity oriented Deep Reinforcement Learning for targeted molecule generation.用于靶向分子生成的面向多样性的深度强化学习
J Cheminform. 2021 Mar 9;13(1):21. doi: 10.1186/s13321-021-00498-z.
2
DrugEx v2: de novo design of drug molecules by Pareto-based multi-objective reinforcement learning in polypharmacology.DrugEx v2:基于帕累托的多目标强化学习在多药理学中从头设计药物分子
J Cheminform. 2021 Nov 12;13(1):85. doi: 10.1186/s13321-021-00561-9.
3
Optimizing blood-brain barrier permeation through deep reinforcement learning for de novo drug design.通过深度强化学习优化血脑屏障通透性,用于从头设计药物。
Bioinformatics. 2021 Jul 12;37(Suppl_1):i84-i92. doi: 10.1093/bioinformatics/btab301.
4
FSM-DDTR: End-to-end feedback strategy for multi-objective De Novo drug design using transformers.FSM-DDTR:使用变压器的多目标从头药物设计的端到端反馈策略。
Comput Biol Med. 2023 Sep;164:107285. doi: 10.1016/j.compbiomed.2023.107285. Epub 2023 Jul 31.
5
An exploration strategy improves the diversity of de novo ligands using deep reinforcement learning: a case for the adenosine A receptor.一种探索策略利用深度强化学习提高从头设计配体的多样性:以腺苷 A 受体为例。
J Cheminform. 2019 May 24;11(1):35. doi: 10.1186/s13321-019-0355-6.
6
Deep reinforcement learning for de novo drug design.基于深度强化学习的从头药物设计。
Sci Adv. 2018 Jul 25;4(7):eaap7885. doi: 10.1126/sciadv.aap7885. eCollection 2018 Jul.
7
Designing optimized drug candidates with Generative Adversarial Network.利用生成对抗网络设计优化的候选药物。
J Cheminform. 2022 Jun 26;14(1):40. doi: 10.1186/s13321-022-00623-6.
8
Faster and more diverse de novo molecular optimization with double-loop reinforcement learning using augmented SMILES.使用增强型 SMILES 进行双环强化学习,实现更快、更多样的从头分子优化。
J Comput Aided Mol Des. 2023 Aug;37(8):373-394. doi: 10.1007/s10822-023-00512-6. Epub 2023 Jun 17.
9
UnCorrupt SMILES: a novel approach to de novo design.未腐败的SMILES:一种全新的从头设计方法。
J Cheminform. 2023 Feb 14;15(1):22. doi: 10.1186/s13321-023-00696-x.
10
Molecule generation using transformers and policy gradient reinforcement learning.基于 Transformer 和策略梯度强化学习的分子生成。
Sci Rep. 2023 May 31;13(1):8799. doi: 10.1038/s41598-023-35648-w.

引用本文的文献

1
Optimizing blood-brain barrier permeability in KRAS inhibitors: A structure-constrained molecular generation approach.优化KRAS抑制剂的血脑屏障通透性:一种结构受限的分子生成方法。
J Pharm Anal. 2025 Aug;15(8):101337. doi: 10.1016/j.jpha.2025.101337. Epub 2025 May 9.
2
A genotype-to-drug diffusion model for generation of tailored anti-cancer small molecules.一种用于生成定制抗癌小分子的基因型到药物扩散模型。
Nat Commun. 2025 Jul 1;16(1):5628. doi: 10.1038/s41467-025-60763-9.
3
Mol-AIR: Molecular Reinforcement Learning with Adaptive Intrinsic Rewards for Goal-Directed Molecular Generation.

本文引用的文献

1
Descriptor Free QSAR Modeling Using Deep Learning With Long Short-Term Memory Neural Networks.使用长短期记忆神经网络的深度学习进行无描述符定量构效关系建模
Front Artif Intell. 2019 Sep 6;2:17. doi: 10.3389/frai.2019.00017. eCollection 2019.
2
Are 2D fingerprints still valuable for drug discovery?二维指纹在药物发现中仍然有价值吗?
Phys Chem Chem Phys. 2020 Apr 29;22(16):8373-8390. doi: 10.1039/d0cp00305k.
3
Optimization of Molecules via Deep Reinforcement Learning.通过深度强化学习优化分子。
Mol-AIR:用于目标导向分子生成的具有自适应内在奖励的分子强化学习
J Chem Inf Model. 2025 Mar 10;65(5):2283-2296. doi: 10.1021/acs.jcim.4c01669. Epub 2025 Feb 24.
4
A systematic review of deep learning chemical language models in recent era.近期深度学习化学语言模型的系统综述。
J Cheminform. 2024 Nov 18;16(1):129. doi: 10.1186/s13321-024-00916-y.
5
Hamiltonian diversity: effectively measuring molecular diversity by shortest Hamiltonian circuits.哈密顿多样性:通过最短哈密顿回路有效测量分子多样性。
J Cheminform. 2024 Aug 7;16(1):94. doi: 10.1186/s13321-024-00883-4.
6
Diverse Hits in De Novo Molecule Design: Diversity-Based Comparison of Goal-Directed Generators.从头分子设计中的多样化命中:基于多样性的目标导向生成器的比较。
J Chem Inf Model. 2024 Aug 12;64(15):5756-5761. doi: 10.1021/acs.jcim.4c00519. Epub 2024 Jul 19.
7
Streamlining Computational Fragment-Based Drug Discovery through Evolutionary Optimization Informed by Ligand-Based Virtual Prescreening.通过基于配体的虚拟预筛选信息指导的进化优化,简化基于片段的药物发现的计算流程。
J Chem Inf Model. 2024 May 13;64(9):3826-3840. doi: 10.1021/acs.jcim.4c00234. Epub 2024 May 2.
8
TransGEM: a molecule generation model based on Transformer with gene expression data.TransGEM:基于基因表达数据的 Transformer 分子生成模型。
Bioinformatics. 2024 May 2;40(5). doi: 10.1093/bioinformatics/btae189.
9
Cheminformatics and artificial intelligence for accelerating agrochemical discovery.用于加速农用化学品发现的化学信息学与人工智能
Front Chem. 2023 Nov 29;11:1292027. doi: 10.3389/fchem.2023.1292027. eCollection 2023.
10
Evolutionary Monte Carlo of QM Properties in Chemical Space: Electrolyte Design.化学空间中量子力学性质的进化蒙特卡罗方法:电解质设计
J Chem Theory Comput. 2023 Dec 12;19(23):8861-8870. doi: 10.1021/acs.jctc.3c00822. Epub 2023 Nov 27.
Sci Rep. 2019 Jul 24;9(1):10752. doi: 10.1038/s41598-019-47148-x.
4
Deep Reinforcement Learning for Multiparameter Optimization in Drug Design.深度强化学习在药物设计中的多参数优化。
J Chem Inf Model. 2019 Jul 22;59(7):3166-3176. doi: 10.1021/acs.jcim.9b00325. Epub 2019 Jul 5.
5
Therapeutic Potential of Kappa Opioid Agonists.κ阿片受体激动剂的治疗潜力
Pharmaceuticals (Basel). 2019 Jun 20;12(2):95. doi: 10.3390/ph12020095.
6
An exploration strategy improves the diversity of de novo ligands using deep reinforcement learning: a case for the adenosine A receptor.一种探索策略利用深度强化学习提高从头设计配体的多样性:以腺苷 A 受体为例。
J Cheminform. 2019 May 24;11(1):35. doi: 10.1186/s13321-019-0355-6.
7
ChEMBL: towards direct deposition of bioassay data.ChEMBL:致力于直接生成生物测定数据。
Nucleic Acids Res. 2019 Jan 8;47(D1):D930-D940. doi: 10.1093/nar/gky1075.
8
Artificial Intelligence in Drug Design.人工智能在药物设计中的应用。
Molecules. 2018 Oct 2;23(10):2520. doi: 10.3390/molecules23102520.
9
Recent applications of machine learning in medicinal chemistry.机器学习在药物化学中的最新应用。
Bioorg Med Chem Lett. 2018 Sep 15;28(17):2807-2815. doi: 10.1016/j.bmcl.2018.06.046. Epub 2018 Jun 28.
10
Fréchet ChemNet Distance: A Metric for Generative Models for Molecules in Drug Discovery.Fréchet ChemNet 距离:药物发现中分子生成模型的一种度量。
J Chem Inf Model. 2018 Sep 24;58(9):1736-1741. doi: 10.1021/acs.jcim.8b00234. Epub 2018 Aug 28.