• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

cMolGPT:一种用于靶向特定从头分子生成的条件生成式预训练转换器。

cMolGPT: A Conditional Generative Pre-Trained Transformer for Target-Specific De Novo Molecular Generation.

机构信息

Biotherapeutic and Medicinal Sciences, Biogen, 225 Binney Street, Cambridge, MA 02142, USA.

College of Agriculture and Life Sciences, Cornell University, Ithaca, NY 14850, USA.

出版信息

Molecules. 2023 May 30;28(11):4430. doi: 10.3390/molecules28114430.

DOI:10.3390/molecules28114430
PMID:37298906
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10254772/
Abstract

Deep generative models applied to the generation of novel compounds in small-molecule drug design have attracted a lot of attention in recent years. To design compounds that interact with specific target proteins, we propose a Generative Pre-Trained Transformer (GPT)-inspired model for de novo target-specific molecular design. By implementing different keys and values for the multi-head attention conditional on a specified target, the proposed method can generate drug-like compounds both with and without a specific target. The results show that our approach (cMolGPT) is capable of generating SMILES strings that correspond to both drug-like and active compounds. Moreover, the compounds generated from the conditional model closely match the chemical space of real target-specific molecules and cover a significant portion of novel compounds. Thus, the proposed Conditional Generative Pre-Trained Transformer (cMolGPT) is a valuable tool for de novo molecule design and has the potential to accelerate the molecular optimization cycle time.

摘要

近年来,应用于小分子药物设计中新型化合物生成的深度生成模型受到了广泛关注。为了设计与特定靶标蛋白相互作用的化合物,我们提出了一种基于生成式预训练转换器(GPT)的新方法,用于从头开始针对特定靶标的分子设计。通过在指定的靶标上为多头注意力实现不同的键和值,该方法可以生成既有又没有特定靶标的类药物化合物。结果表明,我们的方法(cMolGPT)能够生成与类药物和活性化合物相对应的 SMILES 字符串。此外,从条件模型生成的化合物与真实靶标特异性分子的化学空间紧密匹配,并涵盖了很大一部分新型化合物。因此,所提出的条件生成式预训练转换器(cMolGPT)是一种有价值的从头分子设计工具,有可能加速分子优化周期时间。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2304/10254772/247eb0030f00/molecules-28-04430-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2304/10254772/64b2c1385a34/molecules-28-04430-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2304/10254772/c2c171ca4b6c/molecules-28-04430-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2304/10254772/62321f33f14a/molecules-28-04430-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2304/10254772/0c7e7c18dd1c/molecules-28-04430-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2304/10254772/247eb0030f00/molecules-28-04430-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2304/10254772/64b2c1385a34/molecules-28-04430-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2304/10254772/c2c171ca4b6c/molecules-28-04430-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2304/10254772/62321f33f14a/molecules-28-04430-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2304/10254772/0c7e7c18dd1c/molecules-28-04430-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2304/10254772/247eb0030f00/molecules-28-04430-g005.jpg

相似文献

1
cMolGPT: A Conditional Generative Pre-Trained Transformer for Target-Specific De Novo Molecular Generation.cMolGPT:一种用于靶向特定从头分子生成的条件生成式预训练转换器。
Molecules. 2023 May 30;28(11):4430. doi: 10.3390/molecules28114430.
2
Adapt-cMolGPT: A Conditional Generative Pre-Trained Transformer with Adapter-Based Fine-Tuning for Target-Specific Molecular Generation.自适应-cMolGPT:基于适配器的条件生成式预训练转换器,用于特定目标的分子生成微调。
Int J Mol Sci. 2024 Jun 17;25(12):6641. doi: 10.3390/ijms25126641.
3
Generative Pre-trained Transformer (GPT) based model with relative attention for de novo drug design.基于生成式预训练转换器(GPT)的相对注意力模型在从头设计药物中的应用。
Comput Biol Chem. 2023 Oct;106:107911. doi: 10.1016/j.compbiolchem.2023.107911. Epub 2023 Jun 29.
4
MTMol-GPT: De novo multi-target molecular generation with transformer-based generative adversarial imitation learning.MTMol-GPT:基于生成式对抗模仿学习的新型多靶点分子生成
PLoS Comput Biol. 2024 Jun 26;20(6):e1012229. doi: 10.1371/journal.pcbi.1012229. eCollection 2024 Jun.
5
MolGPT: Molecular Generation Using a Transformer-Decoder Model.MolGPT:基于 Transformer-Decoder 模型的分子生成。
J Chem Inf Model. 2022 May 9;62(9):2064-2076. doi: 10.1021/acs.jcim.1c00600. Epub 2021 Oct 25.
6
Llamol: a dynamic multi-conditional generative transformer for de novo molecular design.Llamol:一种用于从头分子设计的动态多条件生成式变换器。
J Cheminform. 2024 Jun 21;16(1):73. doi: 10.1186/s13321-024-00863-8.
7
FSM-DDTR: End-to-end feedback strategy for multi-objective De Novo drug design using transformers.FSM-DDTR:使用变压器的多目标从头药物设计的端到端反馈策略。
Comput Biol Med. 2023 Sep;164:107285. doi: 10.1016/j.compbiomed.2023.107285. Epub 2023 Jul 31.
8
RM-GPT: Enhance the comprehensive generative ability of molecular GPT model via LocalRNN and RealFormer.RM-GPT:通过 LocalRNN 和 RealFormer 增强分子 GPT 模型的综合生成能力。
Artif Intell Med. 2024 Apr;150:102827. doi: 10.1016/j.artmed.2024.102827. Epub 2024 Feb 27.
9
Multi-objective de novo drug design with conditional graph generative model.基于条件图生成模型的多目标从头药物设计
J Cheminform. 2018 Jul 24;10(1):33. doi: 10.1186/s13321-018-0287-6.
10
UnCorrupt SMILES: a novel approach to de novo design.未腐败的SMILES:一种全新的从头设计方法。
J Cheminform. 2023 Feb 14;15(1):22. doi: 10.1186/s13321-023-00696-x.

引用本文的文献

1
A genotype-to-drug diffusion model for generation of tailored anti-cancer small molecules.一种用于生成定制抗癌小分子的基因型到药物扩散模型。
Nat Commun. 2025 Jul 1;16(1):5628. doi: 10.1038/s41467-025-60763-9.
2
A review of large language models and autonomous agents in chemistry.化学领域中大型语言模型与自主智能体的综述。
Chem Sci. 2024 Dec 9;16(6):2514-2572. doi: 10.1039/d4sc03921a. eCollection 2025 Feb 5.
3
Large language models facilitating modern molecular biology and novel drug development.大型语言模型助力现代分子生物学和新型药物研发。

本文引用的文献

1
New avenues in artificial-intelligence-assisted drug discovery.人工智能辅助药物发现的新途径。
Drug Discov Today. 2023 Apr;28(4):103516. doi: 10.1016/j.drudis.2023.103516. Epub 2023 Feb 2.
2
Generative Models for De Novo Drug Design.用于从头药物设计的生成模型。
J Med Chem. 2021 Oct 14;64(19):14011-14027. doi: 10.1021/acs.jmedchem.1c00927. Epub 2021 Sep 17.
3
Molecular optimization by capturing chemist's intuition using deep neural networks.通过使用深度神经网络捕捉化学家的直觉进行分子优化。
Front Pharmacol. 2024 Dec 24;15:1458739. doi: 10.3389/fphar.2024.1458739. eCollection 2024.
4
Machine Learning-Aided Inverse Design and Discovery of Novel Polymeric Materials for Membrane Separation.机器学习辅助的用于膜分离的新型高分子材料逆设计与发现
Environ Sci Technol. 2025 Jan 21;59(2):993-1012. doi: 10.1021/acs.est.4c08298. Epub 2024 Dec 16.
5
A systematic review of deep learning chemical language models in recent era.近期深度学习化学语言模型的系统综述。
J Cheminform. 2024 Nov 18;16(1):129. doi: 10.1186/s13321-024-00916-y.
6
Unlocking comprehensive molecular design across all scenarios with large language model and unordered chemical language.利用大语言模型和无序化学语言解锁所有场景下的全面分子设计。
Chem Sci. 2024 Jul 29;15(34):13727-13740. doi: 10.1039/d4sc03744h. eCollection 2024 Aug 28.
7
Chemical language modeling with structured state space sequence models.基于结构化状态空间序列模型的化学语言建模。
Nat Commun. 2024 Jul 22;15(1):6176. doi: 10.1038/s41467-024-50469-9.
8
Extension of multi-site analogue series with potent compounds using a bidirectional transformer-based chemical language model.使用基于双向变压器的化学语言模型对具有强效化合物的多部位类似物系列进行扩展。
RSC Med Chem. 2024 Jun 17;15(7):2527-2537. doi: 10.1039/d4md00423j. eCollection 2024 Jul 17.
9
Augmented Memory: Sample-Efficient Generative Molecular Design with Reinforcement Learning.增强记忆:基于强化学习的样本高效生成式分子设计
JACS Au. 2024 Apr 10;4(6):2160-2172. doi: 10.1021/jacsau.4c00066. eCollection 2024 Jun 24.
10
Adapt-cMolGPT: A Conditional Generative Pre-Trained Transformer with Adapter-Based Fine-Tuning for Target-Specific Molecular Generation.自适应-cMolGPT:基于适配器的条件生成式预训练转换器,用于特定目标的分子生成微调。
Int J Mol Sci. 2024 Jun 17;25(12):6641. doi: 10.3390/ijms25126641.
J Cheminform. 2021 Mar 20;13(1):26. doi: 10.1186/s13321-021-00497-0.
4
Visualization of very large high-dimensional data sets as minimum spanning trees.将超大型高维数据集可视化为最小生成树。
J Cheminform. 2020 Feb 12;12(1):12. doi: 10.1186/s13321-020-0416-x.
5
A de novo molecular generation method using latent vector based generative adversarial network.一种使用基于潜在向量的生成对抗网络的从头分子生成方法。
J Cheminform. 2019 Dec 3;11(1):74. doi: 10.1186/s13321-019-0397-9.
6
Molecular Sets (MOSES): A Benchmarking Platform for Molecular Generation Models.分子集(MOSES):分子生成模型的基准测试平台。
Front Pharmacol. 2020 Dec 18;11:565644. doi: 10.3389/fphar.2020.565644. eCollection 2020.
7
Deep learning enables rapid identification of potent DDR1 kinase inhibitors.深度学习可快速鉴定有效的 DDR1 激酶抑制剂。
Nat Biotechnol. 2019 Sep;37(9):1038-1040. doi: 10.1038/s41587-019-0224-x. Epub 2019 Sep 2.
8
GuacaMol: Benchmarking Models for de Novo Molecular Design.GuacaMol:从头设计分子的模型基准测试。
J Chem Inf Model. 2019 Mar 25;59(3):1096-1108. doi: 10.1021/acs.jcim.8b00839. Epub 2019 Mar 19.
9
Exploring the GDB-13 chemical space using deep generative models.使用深度生成模型探索GDB-13化学空间。
J Cheminform. 2019 Mar 12;11(1):20. doi: 10.1186/s13321-019-0341-z.
10
A probabilistic molecular fingerprint for big data settings.一种适用于大数据环境的概率分子指纹。
J Cheminform. 2018 Dec 18;10(1):66. doi: 10.1186/s13321-018-0321-8.