• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用生成式预训练变换器和深度强化学习在化学空间中优化结合亲和力

Optimization of binding affinities in chemical space with generative pre-trained transformer and deep reinforcement learning.

作者信息

Xu Xiaopeng, Zhou Juexiao, Zhu Chen, Zhan Qing, Li Zhongxiao, Zhang Ruochi, Wang Yu, Liao Xingyu, Gao Xin

机构信息

Computational Bioscience Research Center (CBRC), King Abdullah University of Science and Technology (KAUST), Thuwal, 23955-6900, Saudi Arabia.

Computer, Electrical and Mathematical Sciences and Engineering (CEMSE), King Abdullah University of Science and Technology (KAUST), Thuwal, 23955-6900, Saudi Arabia.

出版信息

F1000Res. 2024 Feb 20;12:757. doi: 10.12688/f1000research.130936.2. eCollection 2023.

DOI:10.12688/f1000research.130936.2
PMID:38434657
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10905145/
Abstract

BACKGROUND

The key challenge in drug discovery is to discover novel compounds with desirable properties. Among the properties, binding affinity to a target is one of the prerequisites and usually evaluated by molecular docking or quantitative structure activity relationship (QSAR) models.

METHODS

In this study, we developed SGPT-RL, which uses a generative pre-trained transformer (GPT) as the policy network of the reinforcement learning (RL) agent to optimize the binding affinity to a target. SGPT-RL was evaluated on the Moses distribution learning benchmark and two goal-directed generation tasks, with Dopamine Receptor D2 (DRD2) and Angiotensin-Converting Enzyme 2 (ACE2) as the targets. Both QSAR model and molecular docking were implemented as the optimization goals in the tasks. The popular Reinvent method was used as the baseline for comparison.

RESULTS

The results on the Moses benchmark showed that SGPT-RL learned good property distributions and generated molecules with high validity and novelty. On the two goal-directed generation tasks, both SGPT-RL and Reinvent were able to generate valid molecules with improved target scores. The SGPT-RL method achieved better results than Reinvent on the ACE2 task, where molecular docking was used as the optimization goal. Further analysis shows that SGPT-RL learned conserved scaffold patterns during exploration.

CONCLUSIONS

The superior performance of SGPT-RL in the ACE2 task indicates that it can be applied to the virtual screening process where molecular docking is widely used as the criteria. Besides, the scaffold patterns learned by SGPT-RL during the exploration process can assist chemists to better design and discover novel lead candidates.

摘要

背景

药物研发中的关键挑战是发现具有理想特性的新型化合物。在这些特性中,与靶点的结合亲和力是先决条件之一,通常通过分子对接或定量构效关系(QSAR)模型进行评估。

方法

在本研究中,我们开发了SGPT-RL,它使用生成式预训练变换器(GPT)作为强化学习(RL)智能体的策略网络,以优化与靶点的结合亲和力。SGPT-RL在Moses分布学习基准以及两个目标导向生成任务上进行了评估,以多巴胺受体D2(DRD2)和血管紧张素转换酶2(ACE2)作为靶点。QSAR模型和分子对接均被用作任务中的优化目标。采用流行的Reinvent方法作为比较基线。

结果

在Moses基准上的结果表明,SGPT-RL学习到了良好的特性分布,并生成了具有高有效性和新颖性的分子。在两个目标导向生成任务中,SGPT-RL和Reinvent都能够生成具有更高靶点得分的有效分子。在以分子对接作为优化目标的ACE2任务中,SGPT-RL方法比Reinvent取得了更好的结果。进一步分析表明,SGPT-RL在探索过程中学习到了保守的骨架模式。

结论

SGPT-RL在ACE2任务中的卓越性能表明,它可应用于广泛使用分子对接作为标准的虚拟筛选过程。此外,SGPT-RL在探索过程中学习到的骨架模式可以帮助化学家更好地设计和发现新型先导候选物。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5b7b/10905186/3f2eb872ed50/f1000research-12-162639-g0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5b7b/10905186/c84295779bf6/f1000research-12-162639-g0000.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5b7b/10905186/a5118dcd26a6/f1000research-12-162639-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5b7b/10905186/5414b8fcc75f/f1000research-12-162639-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5b7b/10905186/114f410209b8/f1000research-12-162639-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5b7b/10905186/0019375c82a3/f1000research-12-162639-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5b7b/10905186/3f2eb872ed50/f1000research-12-162639-g0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5b7b/10905186/c84295779bf6/f1000research-12-162639-g0000.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5b7b/10905186/a5118dcd26a6/f1000research-12-162639-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5b7b/10905186/5414b8fcc75f/f1000research-12-162639-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5b7b/10905186/114f410209b8/f1000research-12-162639-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5b7b/10905186/0019375c82a3/f1000research-12-162639-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5b7b/10905186/3f2eb872ed50/f1000research-12-162639-g0005.jpg

相似文献

1
Optimization of binding affinities in chemical space with generative pre-trained transformer and deep reinforcement learning.利用生成式预训练变换器和深度强化学习在化学空间中优化结合亲和力
F1000Res. 2024 Feb 20;12:757. doi: 10.12688/f1000research.130936.2. eCollection 2023.
2
Comparison of structure- and ligand-based scoring functions for deep generative models: a GPCR case study.深度生成模型中基于结构和配体的评分函数比较:以G蛋白偶联受体为例的研究
J Cheminform. 2021 May 13;13(1):39. doi: 10.1186/s13321-021-00516-0.
3
Evaluation of reinforcement learning in transformer-based molecular design.基于Transformer的分子设计中强化学习的评估
J Cheminform. 2024 Aug 8;16(1):95. doi: 10.1186/s13321-024-00887-0.
4
AB-Gen: Antibody Library Design with Generative Pre-trained Transformer and Deep Reinforcement Learning.AB-Gen:基于生成式预训练变换器和深度强化学习的抗体库设计
Genomics Proteomics Bioinformatics. 2023 Oct;21(5):1043-1053. doi: 10.1016/j.gpb.2023.03.004. Epub 2023 Jun 24.
5
De Novo Drug Design Using Transformer-Based Machine Translation and Reinforcement Learning of an Adaptive Monte Carlo Tree Search.基于Transformer的机器翻译和自适应蒙特卡罗树搜索强化学习的从头药物设计
Pharmaceuticals (Basel). 2024 Jan 27;17(2):161. doi: 10.3390/ph17020161.
6
Molecule generation toward target protein (SARS-CoV-2) using reinforcement learning-based graph neural network via knowledge graph.通过知识图谱,利用基于强化学习的图神经网络生成针对目标蛋白(SARS-CoV-2)的分子。
Netw Model Anal Health Inform Bioinform. 2023;12(1):13. doi: 10.1007/s13721-023-00409-2. Epub 2023 Jan 6.
7
GRELinker: A Graph-Based Generative Model for Molecular Linker Design with Reinforcement and Curriculum Learning.GRELinker:一种基于图的生成模型,用于通过强化学习和课程学习进行分子连接体设计。
J Chem Inf Model. 2024 Feb 12;64(3):666-676. doi: 10.1021/acs.jcim.3c01700. Epub 2024 Jan 19.
8
MTMol-GPT: De novo multi-target molecular generation with transformer-based generative adversarial imitation learning.MTMol-GPT:基于生成式对抗模仿学习的新型多靶点分子生成
PLoS Comput Biol. 2024 Jun 26;20(6):e1012229. doi: 10.1371/journal.pcbi.1012229. eCollection 2024 Jun.
9
FSM-DDTR: End-to-end feedback strategy for multi-objective De Novo drug design using transformers.FSM-DDTR:使用变压器的多目标从头药物设计的端到端反馈策略。
Comput Biol Med. 2023 Sep;164:107285. doi: 10.1016/j.compbiomed.2023.107285. Epub 2023 Jul 31.
10
DockingGA: enhancing targeted molecule generation using transformer neural network and genetic algorithm with docking simulation.对接遗传算法:利用变压器神经网络和遗传算法结合对接模拟增强靶向分子生成
Brief Funct Genomics. 2024 Sep 27;23(5):595-606. doi: 10.1093/bfgp/elae011.

引用本文的文献

1
HELM-GPT: de novo macrocyclic peptide design using generative pre-trained transformer.HELM-GPT:使用生成式预训练转换器进行从头大环肽设计。
Bioinformatics. 2024 Jun 3;40(6). doi: 10.1093/bioinformatics/btae364.

本文引用的文献

1
Impact of computational approaches in the fight against COVID-19: an AI guided review of 17 000 studies.计算方法在抗击 COVID-19 中的作用:人工智能指导的 17000 项研究综述。
Brief Bioinform. 2022 Jan 17;23(1). doi: 10.1093/bib/bbab456.
2
MolGPT: Molecular Generation Using a Transformer-Decoder Model.MolGPT:基于 Transformer-Decoder 模型的分子生成。
J Chem Inf Model. 2022 May 9;62(9):2064-2076. doi: 10.1021/acs.jcim.1c00600. Epub 2021 Oct 25.
3
Molecular optimization by capturing chemist's intuition using deep neural networks.
通过使用深度神经网络捕捉化学家的直觉进行分子优化。
J Cheminform. 2021 Mar 20;13(1):26. doi: 10.1186/s13321-021-00497-0.
4
Transformer neural network for protein-specific de novo drug generation as a machine translation problem.基于机器翻译问题的用于蛋白质特异性从头药物生成的 Transformer 神经网络。
Sci Rep. 2021 Jan 11;11(1):321. doi: 10.1038/s41598-020-79682-4.
5
Molecular Sets (MOSES): A Benchmarking Platform for Molecular Generation Models.分子集(MOSES):分子生成模型的基准测试平台。
Front Pharmacol. 2020 Dec 18;11:565644. doi: 10.3389/fphar.2020.565644. eCollection 2020.
6
Graph-based generative models for de Novo drug design.基于图的从头药物设计生成模型。
Drug Discov Today Technol. 2019 Dec;32-33:45-53. doi: 10.1016/j.ddtec.2020.11.004. Epub 2020 Nov 21.
7
REINVENT 2.0: An AI Tool for De Novo Drug Design.REINVENT 2.0:一种用于从头设计药物的人工智能工具。
J Chem Inf Model. 2020 Dec 28;60(12):5918-5922. doi: 10.1021/acs.jcim.0c00915. Epub 2020 Oct 29.
8
OptiMol: Optimization of Binding Affinities in Chemical Space for Drug Discovery.OptiMol:化学空间中用于药物发现的结合亲和力优化。
J Chem Inf Model. 2020 Dec 28;60(12):5658-5666. doi: 10.1021/acs.jcim.0c00833. Epub 2020 Oct 7.
9
The Advent of Generative Chemistry.生成化学的出现。
ACS Med Chem Lett. 2020 Jul 14;11(8):1496-1505. doi: 10.1021/acsmedchemlett.0c00088. eCollection 2020 Aug 13.
10
A pneumonia outbreak associated with a new coronavirus of probable bat origin.一种新型冠状病毒引发的肺炎疫情,该病毒可能来源于蝙蝠。
Nature. 2020 Mar;579(7798):270-273. doi: 10.1038/s41586-020-2012-7. Epub 2020 Feb 3.