• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

Magicmol:一个轻量级的药物分子进化和快速化学空间探索的流水线。

Magicmol: a light-weighted pipeline for drug-like molecule evolution and quick chemical space exploration.

机构信息

Yangtze Delta Region (Huzhou) Institute of Intelligent Transportation, Huzhou University, Huzhou, China.

Zhejiang Province Key Laboratory of Smart Management and Application of Modern Agricultural Resources, School of Information Engineering, Huzhou University, Huzhou, China.

出版信息

BMC Bioinformatics. 2023 Apr 26;24(1):173. doi: 10.1186/s12859-023-05286-0.

DOI:10.1186/s12859-023-05286-0
PMID:37101113
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10132416/
Abstract

The flourishment of machine learning and deep learning methods has boosted the development of cheminformatics, especially regarding the application of drug discovery and new material exploration. Lower time and space expenses make it possible for scientists to search the enormous chemical space. Recently, some work combined reinforcement learning strategies with recurrent neural network (RNN)-based models to optimize the property of generated small molecules, which notably improved a batch of critical factors for these candidates. However, a common problem among these RNN-based methods is that several generated molecules have difficulty in synthesizing despite owning higher desired properties such as binding affinity. However, RNN-based framework better reproduces the molecule distribution among the training set than other categories of models during molecule exploration tasks. Thus, to optimize the whole exploration process and make it contribute to the optimization of specified molecules, we devised a light-weighted pipeline called Magicmol; this pipeline has a re-mastered RNN network and utilize SELFIES presentation instead of SMILES. Our backbone model achieved extraordinary performance while reducing the training cost; moreover, we devised reward truncate strategies to eliminate the model collapse problem. Additionally, adopting SELFIES presentation made it possible to combine STONED-SELFIES as a post-processing procedure for specified molecule optimization and quick chemical space exploration.

摘要

机器学习和深度学习方法的蓬勃发展推动了化学信息学的发展,特别是在药物发现和新材料探索方面的应用。较低的时间和空间成本使得科学家有可能搜索巨大的化学空间。最近,一些工作将强化学习策略与基于递归神经网络(RNN)的模型相结合,以优化生成小分子的性质,这显著提高了这些候选物的一批关键因素。然而,这些基于 RNN 的方法的一个共同问题是,尽管生成的分子具有较高的理想性质,如结合亲和力,但仍有一些难以合成。然而,在分子探索任务中,基于 RNN 的框架比其他类别的模型更能在训练集中再现分子分布。因此,为了优化整个探索过程,并使其有助于优化指定的分子,我们设计了一个名为 Magicmol 的轻量级管道;该管道具有重新掌握的 RNN 网络,并使用 SELFIES 表示而不是 SMILES。我们的骨干模型在降低训练成本的同时实现了卓越的性能;此外,我们设计了奖励截断策略来消除模型崩溃问题。此外,采用 SELFIES 表示使得可以将 STONED-SELFIES 作为指定分子优化和快速化学空间探索的后处理过程。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c476/10134645/e1e06ab11d62/12859_2023_5286_Fig12_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c476/10134645/8b08c8000d45/12859_2023_5286_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c476/10134645/f8d82d6a3315/12859_2023_5286_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c476/10134645/36aa5122875b/12859_2023_5286_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c476/10134645/671b0eeb07c8/12859_2023_5286_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c476/10134645/93b33e47eb13/12859_2023_5286_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c476/10134645/825017252868/12859_2023_5286_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c476/10134645/acc5a544d6fc/12859_2023_5286_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c476/10134645/8cb23d55d264/12859_2023_5286_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c476/10134645/ea54844cb62f/12859_2023_5286_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c476/10134645/5b4ad2ff2d93/12859_2023_5286_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c476/10134645/50099efe1ce3/12859_2023_5286_Fig11_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c476/10134645/e1e06ab11d62/12859_2023_5286_Fig12_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c476/10134645/8b08c8000d45/12859_2023_5286_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c476/10134645/f8d82d6a3315/12859_2023_5286_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c476/10134645/36aa5122875b/12859_2023_5286_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c476/10134645/671b0eeb07c8/12859_2023_5286_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c476/10134645/93b33e47eb13/12859_2023_5286_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c476/10134645/825017252868/12859_2023_5286_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c476/10134645/acc5a544d6fc/12859_2023_5286_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c476/10134645/8cb23d55d264/12859_2023_5286_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c476/10134645/ea54844cb62f/12859_2023_5286_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c476/10134645/5b4ad2ff2d93/12859_2023_5286_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c476/10134645/50099efe1ce3/12859_2023_5286_Fig11_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c476/10134645/e1e06ab11d62/12859_2023_5286_Fig12_HTML.jpg

相似文献

1
Magicmol: a light-weighted pipeline for drug-like molecule evolution and quick chemical space exploration.Magicmol:一个轻量级的药物分子进化和快速化学空间探索的流水线。
BMC Bioinformatics. 2023 Apr 26;24(1):173. doi: 10.1186/s12859-023-05286-0.
2
De novo drug design based on Stack-RNN with multi-objective reward-weighted sum and reinforcement learning.基于堆叠循环神经网络的多目标奖励加权和强化学习的从头药物设计。
J Mol Model. 2023 Mar 30;29(4):121. doi: 10.1007/s00894-023-05523-6.
3
Deep inverse reinforcement learning for structural evolution of small molecules.小分子结构演化的深度逆向强化学习。
Brief Bioinform. 2021 Jul 20;22(4). doi: 10.1093/bib/bbaa364.
4
DrugEx v2: de novo design of drug molecules by Pareto-based multi-objective reinforcement learning in polypharmacology.DrugEx v2:基于帕累托的多目标强化学习在多药理学中从头设计药物分子
J Cheminform. 2021 Nov 12;13(1):85. doi: 10.1186/s13321-021-00561-9.
5
Design of Molecules with Low Hole Reorganization Energy Based on a Quarter-Million Molecule DFT Screen: Part 2.基于二十五万分子密度泛函理论筛选的低空穴重组能分子设计:第二部分
J Phys Chem A. 2022 Sep 1;126(34):5837-5852. doi: 10.1021/acs.jpca.2c04221. Epub 2022 Aug 19.
6
The power of deep learning to ligand-based novel drug discovery.深度学习在基于配体的新药发现中的作用。
Expert Opin Drug Discov. 2020 Jul;15(7):755-764. doi: 10.1080/17460441.2020.1745183. Epub 2020 Mar 31.
7
Training recurrent neural networks as generative neural networks for molecular structures: how does it impact drug discovery?将循环神经网络训练为生成式神经网络用于分子结构:它如何影响药物发现?
Expert Opin Drug Discov. 2022 Oct;17(10):1071-1079. doi: 10.1080/17460441.2023.2134340. Epub 2022 Oct 17.
8
An exploration strategy improves the diversity of de novo ligands using deep reinforcement learning: a case for the adenosine A receptor.一种探索策略利用深度强化学习提高从头设计配体的多样性:以腺苷 A 受体为例。
J Cheminform. 2019 May 24;11(1):35. doi: 10.1186/s13321-019-0355-6.
9
De novo drug design as GPT language modeling: large chemistry models with supervised and reinforcement learning.从头开始的药物设计与 GPT 语言模型:具有监督和强化学习的大型化学模型。
J Comput Aided Mol Des. 2024 Apr 22;38(1):20. doi: 10.1007/s10822-024-00559-z.
10
Faster and more diverse de novo molecular optimization with double-loop reinforcement learning using augmented SMILES.使用增强型 SMILES 进行双环强化学习,实现更快、更多样的从头分子优化。
J Comput Aided Mol Des. 2023 Aug;37(8):373-394. doi: 10.1007/s10822-023-00512-6. Epub 2023 Jun 17.

本文引用的文献

1
MolSearch: Search-based Multi-objective Molecular Generation and Property Optimization.MolSearch:基于搜索的多目标分子生成与性质优化
KDD. 2022 Aug;2022:4724-4732. doi: 10.1145/3534678.3542676. Epub 2022 Aug 14.
2
SELFIES and the future of molecular string representations.自拍与分子串表示法的未来。
Patterns (N Y). 2022 Oct 14;3(10):100588. doi: 10.1016/j.patter.2022.100588.
3
Multi-Objective Drug Design Based on Graph-Fragment Molecular Representation and Deep Evolutionary Learning.基于图片段分子表示和深度进化学习的多目标药物设计
Front Pharmacol. 2022 Jul 4;13:920747. doi: 10.3389/fphar.2022.920747. eCollection 2022.
4
Language models can learn complex molecular distributions.语言模型可以学习复杂的分子分布。
Nat Commun. 2022 Jun 7;13(1):3293. doi: 10.1038/s41467-022-30839-x.
5
MGCVAE: Multi-Objective Inverse Design via Molecular Graph Conditional Variational Autoencoder.MGCVAE:基于分子图条件变分自动编码器的多目标反设计。
J Chem Inf Model. 2022 Jun 27;62(12):2943-2950. doi: 10.1021/acs.jcim.2c00487. Epub 2022 Jun 6.
6
Using deep neural networks to explore chemical space.使用深度神经网络探索化学空间。
Expert Opin Drug Discov. 2022 Mar;17(3):297-304. doi: 10.1080/17460441.2022.2019704. Epub 2021 Dec 29.
7
MoleGuLAR: Molecule Generation Using Reinforcement Learning with Alternating Rewards.MoleGuLAR:使用交替奖励的强化学习进行分子生成。
J Chem Inf Model. 2021 Dec 27;61(12):5815-5826. doi: 10.1021/acs.jcim.1c01341. Epub 2021 Dec 6.
8
Generative Models for De Novo Drug Design.用于从头药物设计的生成模型。
J Med Chem. 2021 Oct 14;64(19):14011-14027. doi: 10.1021/acs.jmedchem.1c00927. Epub 2021 Sep 17.
9
Beyond generative models: superfast traversal, optimization, novelty, exploration and discovery (STONED) algorithm for molecules using SELFIES.超越生成模型:使用SELFIES的分子超快速遍历、优化、新颖性、探索与发现(STONED)算法
Chem Sci. 2021 Apr 20;12(20):7079-7090. doi: 10.1039/d1sc00231g.
10
Graph neural networks for automated de novo drug design.图神经网络在药物从头设计中的应用。
Drug Discov Today. 2021 Jun;26(6):1382-1393. doi: 10.1016/j.drudis.2021.02.011. Epub 2021 Feb 17.