用于靶向分子生成的面向多样性的深度强化学习

Diversity oriented Deep Reinforcement Learning for targeted molecule generation.

作者信息

Pereira Tiago, Abbasi Maryam, Ribeiro Bernardete, Arrais Joel P

机构信息

Department of Informatics Engineering, Centre for Informatics and Systems of the University of Coimbra, University of Coimbra, Pinhal de Marrocos, Coimbra, Portugal.

出版信息

J Cheminform. 2021 Mar 9;13(1):21. doi: 10.1186/s13321-021-00498-z.

DOI:10.1186/s13321-021-00498-z

PMID:33750461

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7944916/

Abstract

In this work, we explore the potential of deep learning to streamline the process of identifying new potential drugs through the computational generation of molecules with interesting biological properties. Two deep neural networks compose our targeted generation framework: the Generator, which is trained to learn the building rules of valid molecules employing SMILES strings notation, and the Predictor which evaluates the newly generated compounds by predicting their affinity for the desired target. Then, the Generator is optimized through Reinforcement Learning to produce molecules with bespoken properties. The innovation of this approach is the exploratory strategy applied during the reinforcement training process that seeks to add novelty to the generated compounds. This training strategy employs two Generators interchangeably to sample new SMILES: the initially trained model that will remain fixed and a copy of the previous one that will be updated during the training to uncover the most promising molecules. The evolution of the reward assigned by the Predictor determines how often each one is employed to select the next token of the molecule. This strategy establishes a compromise between the need to acquire more information about the chemical space and the need to sample new molecules, with the experience gained so far. To demonstrate the effectiveness of the method, the Generator is trained to design molecules with an optimized coefficient of partition and also high inhibitory power against the Adenosine [Formula: see text] and [Formula: see text] opioid receptors. The results reveal that the model can effectively adjust the newly generated molecules towards the wanted direction. More importantly, it was possible to find promising sets of unique and diverse molecules, which was the main purpose of the newly implemented strategy.

摘要

在这项工作中，我们探索了深度学习的潜力，通过计算生成具有有趣生物学特性的分子，来简化识别新潜在药物的过程。我们的目标生成框架由两个深度神经网络组成：生成器，它经过训练以学习使用SMILES字符串表示法的有效分子的构建规则；预测器，它通过预测新生成化合物对所需靶点的亲和力来评估这些化合物。然后，通过强化学习对生成器进行优化，以生成具有特定性质的分子。这种方法的创新之处在于在强化训练过程中应用的探索策略，该策略旨在为生成的化合物增添新颖性。这种训练策略交替使用两个生成器来采样新的SMILES：初始训练的模型将保持不变，以及前一个模型的副本，该副本将在训练过程中更新，以发现最有前景的分子。预测器分配的奖励的演变决定了每个生成器用于选择分子的下一个令牌的频率。这种策略在获取更多关于化学空间信息的需求与采样新分子的需求之间，以及与迄今为止获得的经验之间达成了妥协。为了证明该方法的有效性，对生成器进行训练，以设计具有优化分配系数且对腺苷[公式：见正文]和[公式：见正文]阿片受体具有高抑制能力的分子。结果表明，该模型可以有效地将新生成的分子朝着期望的方向调整。更重要的是，有可能找到一组有前景的独特且多样的分子，这是新实施策略的主要目的。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4aee/7944916/c1af330193a1/13321_2021_498_Fig1_HTML.jpg

相似文献

Diversity oriented Deep Reinforcement Learning for targeted molecule generation.

J Cheminform. 2021 Mar 9;13(1):21. doi: 10.1186/s13321-021-00498-z.

DrugEx v2: de novo design of drug molecules by Pareto-based multi-objective reinforcement learning in polypharmacology.

J Cheminform. 2021 Nov 12;13(1):85. doi: 10.1186/s13321-021-00561-9.

Optimizing blood-brain barrier permeation through deep reinforcement learning for de novo drug design.

Bioinformatics. 2021 Jul 12;37(Suppl_1):i84-i92. doi: 10.1093/bioinformatics/btab301.

FSM-DDTR: End-to-end feedback strategy for multi-objective De Novo drug design using transformers.

Comput Biol Med. 2023 Sep;164:107285. doi: 10.1016/j.compbiomed.2023.107285. Epub 2023 Jul 31.

An exploration strategy improves the diversity of de novo ligands using deep reinforcement learning: a case for the adenosine A receptor.

J Cheminform. 2019 May 24;11(1):35. doi: 10.1186/s13321-019-0355-6.

Deep reinforcement learning for de novo drug design.

Sci Adv. 2018 Jul 25;4(7):eaap7885. doi: 10.1126/sciadv.aap7885. eCollection 2018 Jul.

Designing optimized drug candidates with Generative Adversarial Network.

J Cheminform. 2022 Jun 26;14(1):40. doi: 10.1186/s13321-022-00623-6.

Faster and more diverse de novo molecular optimization with double-loop reinforcement learning using augmented SMILES.

J Comput Aided Mol Des. 2023 Aug;37(8):373-394. doi: 10.1007/s10822-023-00512-6. Epub 2023 Jun 17.

UnCorrupt SMILES: a novel approach to de novo design.

J Cheminform. 2023 Feb 14;15(1):22. doi: 10.1186/s13321-023-00696-x.

Molecule generation using transformers and policy gradient reinforcement learning.

Sci Rep. 2023 May 31;13(1):8799. doi: 10.1038/s41598-023-35648-w.

引用本文的文献

Optimizing blood-brain barrier permeability in KRAS inhibitors: A structure-constrained molecular generation approach.

J Pharm Anal. 2025 Aug;15(8):101337. doi: 10.1016/j.jpha.2025.101337. Epub 2025 May 9.

A genotype-to-drug diffusion model for generation of tailored anti-cancer small molecules.

Nat Commun. 2025 Jul 1;16(1):5628. doi: 10.1038/s41467-025-60763-9.

Mol-AIR: Molecular Reinforcement Learning with Adaptive Intrinsic Rewards for Goal-Directed Molecular Generation.

J Chem Inf Model. 2025 Mar 10;65(5):2283-2296. doi: 10.1021/acs.jcim.4c01669. Epub 2025 Feb 24.

A systematic review of deep learning chemical language models in recent era.

J Cheminform. 2024 Nov 18;16(1):129. doi: 10.1186/s13321-024-00916-y.

Hamiltonian diversity: effectively measuring molecular diversity by shortest Hamiltonian circuits.

J Cheminform. 2024 Aug 7;16(1):94. doi: 10.1186/s13321-024-00883-4.

Diverse Hits in De Novo Molecule Design: Diversity-Based Comparison of Goal-Directed Generators.

J Chem Inf Model. 2024 Aug 12;64(15):5756-5761. doi: 10.1021/acs.jcim.4c00519. Epub 2024 Jul 19.

Streamlining Computational Fragment-Based Drug Discovery through Evolutionary Optimization Informed by Ligand-Based Virtual Prescreening.

J Chem Inf Model. 2024 May 13;64(9):3826-3840. doi: 10.1021/acs.jcim.4c00234. Epub 2024 May 2.

TransGEM: a molecule generation model based on Transformer with gene expression data.

Bioinformatics. 2024 May 2;40(5). doi: 10.1093/bioinformatics/btae189.

Cheminformatics and artificial intelligence for accelerating agrochemical discovery.

Front Chem. 2023 Nov 29;11:1292027. doi: 10.3389/fchem.2023.1292027. eCollection 2023.

Evolutionary Monte Carlo of QM Properties in Chemical Space: Electrolyte Design.

J Chem Theory Comput. 2023 Dec 12;19(23):8861-8870. doi: 10.1021/acs.jctc.3c00822. Epub 2023 Nov 27.

本文引用的文献

Descriptor Free QSAR Modeling Using Deep Learning With Long Short-Term Memory Neural Networks.

Front Artif Intell. 2019 Sep 6;2:17. doi: 10.3389/frai.2019.00017. eCollection 2019.

Are 2D fingerprints still valuable for drug discovery?

Phys Chem Chem Phys. 2020 Apr 29;22(16):8373-8390. doi: 10.1039/d0cp00305k.

Optimization of Molecules via Deep Reinforcement Learning.

Sci Rep. 2019 Jul 24;9(1):10752. doi: 10.1038/s41598-019-47148-x.

Deep Reinforcement Learning for Multiparameter Optimization in Drug Design.

J Chem Inf Model. 2019 Jul 22;59(7):3166-3176. doi: 10.1021/acs.jcim.9b00325. Epub 2019 Jul 5.

Therapeutic Potential of Kappa Opioid Agonists.

Pharmaceuticals (Basel). 2019 Jun 20;12(2):95. doi: 10.3390/ph12020095.

An exploration strategy improves the diversity of de novo ligands using deep reinforcement learning: a case for the adenosine A receptor.

J Cheminform. 2019 May 24;11(1):35. doi: 10.1186/s13321-019-0355-6.

ChEMBL: towards direct deposition of bioassay data.

Nucleic Acids Res. 2019 Jan 8;47(D1):D930-D940. doi: 10.1093/nar/gky1075.

Artificial Intelligence in Drug Design.

Molecules. 2018 Oct 2;23(10):2520. doi: 10.3390/molecules23102520.

Recent applications of machine learning in medicinal chemistry.

Bioorg Med Chem Lett. 2018 Sep 15;28(17):2807-2815. doi: 10.1016/j.bmcl.2018.06.046. Epub 2018 Jun 28.

Fréchet ChemNet Distance: A Metric for Generative Models for Molecules in Drug Discovery.

J Chem Inf Model. 2018 Sep 24;58(9):1736-1741. doi: 10.1021/acs.jcim.8b00234. Epub 2018 Aug 28.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

用于靶向分子生成的面向多样性的深度强化学习

Diversity oriented Deep Reinforcement Learning for targeted molecule generation.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献