• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用变压器模型进行详尽的局部化学空间探索。

Exhaustive local chemical space exploration using a transformer model.

作者信息

Tibo Alessandro, He Jiazhen, Janet Jon Paul, Nittinger Eva, Engkvist Ola

机构信息

Molecular AI, Discovery Sciences, R&D, AstraZeneca, Gothenburg, Sweden.

Medicinal Chemistry, Research and Early Development, Respiratory and Immunology (R&I), BioPharmaceuticals R&D AstraZeneca, Gothenburg, Sweden.

出版信息

Nat Commun. 2024 Aug 25;15(1):7315. doi: 10.1038/s41467-024-51672-4.

DOI:10.1038/s41467-024-51672-4
PMID:39183239
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11345417/
Abstract

How many near-neighbors does a molecule have? This fundamental question in chemistry is crucial for molecular optimization problems under the similarity principle assumption. Generative models can sample molecules from a vast chemical space but lack explicit knowledge about molecular similarity. Therefore, these models need guidance from reinforcement learning to sample a relevant similar chemical space. However, they still miss a mechanism to measure the coverage of a specific region of the chemical space. To overcome these limitations, a source-target molecular transformer model, regularized via a similarity kernel function, is proposed. Trained on a largest dataset of ≥200 billion molecular pairs, the model enforces a direct relationship between generating a target molecule and its similarity to a source molecule. Results indicate that the regularization term significantly improves the correlation between generation probability and molecular similarity, enabling exhaustive exploration of molecule near-neighborhoods.

摘要

一个分子有多少近邻?化学中的这个基本问题对于相似性原理假设下的分子优化问题至关重要。生成模型可以从广阔的化学空间中采样分子,但缺乏关于分子相似性的明确知识。因此,这些模型需要强化学习的指导来采样相关的相似化学空间。然而,它们仍然缺少一种机制来测量化学空间特定区域的覆盖范围。为了克服这些限制,提出了一种通过相似性核函数进行正则化的源 - 目标分子变压器模型。该模型在一个≥2000亿对分子的最大数据集上进行训练,在生成目标分子与其与源分子的相似性之间建立了直接关系。结果表明,正则化项显著提高了生成概率与分子相似性之间的相关性,能够对分子近邻进行详尽探索。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8b5c/11345417/de309968eefb/41467_2024_51672_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8b5c/11345417/f447e144ca04/41467_2024_51672_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8b5c/11345417/a51f5f128ebb/41467_2024_51672_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8b5c/11345417/cee7f6b8af61/41467_2024_51672_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8b5c/11345417/71ec3708815e/41467_2024_51672_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8b5c/11345417/a945c944b407/41467_2024_51672_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8b5c/11345417/25fcaef12ab8/41467_2024_51672_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8b5c/11345417/de309968eefb/41467_2024_51672_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8b5c/11345417/f447e144ca04/41467_2024_51672_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8b5c/11345417/a51f5f128ebb/41467_2024_51672_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8b5c/11345417/cee7f6b8af61/41467_2024_51672_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8b5c/11345417/71ec3708815e/41467_2024_51672_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8b5c/11345417/a945c944b407/41467_2024_51672_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8b5c/11345417/25fcaef12ab8/41467_2024_51672_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8b5c/11345417/de309968eefb/41467_2024_51672_Fig7_HTML.jpg

相似文献

1
Exhaustive local chemical space exploration using a transformer model.使用变压器模型进行详尽的局部化学空间探索。
Nat Commun. 2024 Aug 25;15(1):7315. doi: 10.1038/s41467-024-51672-4.
2
Evaluation of reinforcement learning in transformer-based molecular design.基于Transformer的分子设计中强化学习的评估
J Cheminform. 2024 Aug 8;16(1):95. doi: 10.1186/s13321-024-00887-0.
3
cMolGPT: A Conditional Generative Pre-Trained Transformer for Target-Specific De Novo Molecular Generation.cMolGPT:一种用于靶向特定从头分子生成的条件生成式预训练转换器。
Molecules. 2023 May 30;28(11):4430. doi: 10.3390/molecules28114430.
4
Optimization of binding affinities in chemical space with generative pre-trained transformer and deep reinforcement learning.利用生成式预训练变换器和深度强化学习在化学空间中优化结合亲和力
F1000Res. 2024 Feb 20;12:757. doi: 10.12688/f1000research.130936.2. eCollection 2023.
5
Transformer-based molecular optimization beyond matched molecular pairs.超越匹配分子对的基于Transformer的分子优化。
J Cheminform. 2022 Mar 28;14(1):18. doi: 10.1186/s13321-022-00599-3.
6
Generative Pre-trained Transformer (GPT) based model with relative attention for de novo drug design.基于生成式预训练转换器(GPT)的相对注意力模型在从头设计药物中的应用。
Comput Biol Chem. 2023 Oct;106:107911. doi: 10.1016/j.compbiolchem.2023.107911. Epub 2023 Jun 29.
7
Efficient virtual high-content screening using a distance-aware transformer model.使用距离感知变压器模型进行高效虚拟高内涵筛选。
J Cheminform. 2023 Feb 8;15(1):18. doi: 10.1186/s13321-023-00686-z.
8
Interpretable Machine Learning Models for Molecular Design of Tyrosine Kinase Inhibitors Using Variational Autoencoders and Perturbation-Based Approach of Chemical Space Exploration.基于变分自动编码器和基于扰动的化学空间探索方法的酪氨酸激酶抑制剂分子设计可解释机器学习模型。
Int J Mol Sci. 2022 Sep 24;23(19):11262. doi: 10.3390/ijms231911262.
9
Explore drug-like space with deep generative models.使用深度生成模型探索类药物空间。
Methods. 2023 Feb;210:52-59. doi: 10.1016/j.ymeth.2023.01.004. Epub 2023 Jan 19.
10
Optimization of Molecules via Deep Reinforcement Learning.通过深度强化学习优化分子。
Sci Rep. 2019 Jul 24;9(1):10752. doi: 10.1038/s41598-019-47148-x.

引用本文的文献

1
RSGPT: a generative transformer model for retrosynthesis planning pre-trained on ten billion datapoints.RSGPT:一种基于一百亿数据点进行预训练的用于逆合成规划的生成式变压器模型。
Nat Commun. 2025 Jul 31;16(1):7012. doi: 10.1038/s41467-025-62308-6.
2
Recent advances in AI-based toxicity prediction for drug discovery.基于人工智能的药物发现毒性预测的最新进展。
Front Chem. 2025 Jul 8;13:1632046. doi: 10.3389/fchem.2025.1632046. eCollection 2025.
3
PepINVENT: generative peptide design beyond natural amino acids.PepINVENT:超越天然氨基酸的生成性肽设计。

本文引用的文献

1
Artificial intelligence in molecular de novo design: Integration with experiment.人工智能在分子从头设计中的应用:与实验的结合。
Curr Opin Struct Biol. 2023 Jun;80:102575. doi: 10.1016/j.sbi.2023.102575. Epub 2023 Mar 24.
2
PubChem 2023 update.PubChem 2023 更新。
Nucleic Acids Res. 2023 Jan 6;51(D1):D1373-D1380. doi: 10.1093/nar/gkac956.
3
Designing optimized drug candidates with Generative Adversarial Network.利用生成对抗网络设计优化的候选药物。
Chem Sci. 2025 Apr 16;16(20):8682-8696. doi: 10.1039/d4sc07642g. eCollection 2025 May 21.
4
Molecular similarity: Theory, applications, and perspectives.分子相似性:理论、应用与展望。
Artif Intell Chem. 2024 Dec;2(2). doi: 10.1016/j.aichem.2024.100077. Epub 2024 Aug 31.
5
Unveiling the power of language models in chemical research question answering.揭示语言模型在化学研究问题解答中的力量。
Commun Chem. 2025 Jan 5;8(1):4. doi: 10.1038/s42004-024-01394-x.
6
Alternative weighting schemes for fine-tuned extended similarity indices.微调扩展相似性指数的替代加权方案。
J Chemom. 2024 Sep;38(9). doi: 10.1002/cem.3558. Epub 2024 May 11.
7
Reinvent 4: Modern AI-driven generative molecule design.重塑4:现代人工智能驱动的生成式分子设计。
J Cheminform. 2024 Feb 21;16(1):20. doi: 10.1186/s13321-024-00812-5.
J Cheminform. 2022 Jun 26;14(1):40. doi: 10.1186/s13321-022-00623-6.
4
Transformer-based molecular optimization beyond matched molecular pairs.超越匹配分子对的基于Transformer的分子优化。
J Cheminform. 2022 Mar 28;14(1):18. doi: 10.1186/s13321-022-00599-3.
5
Therapeutic target database update 2022: facilitating drug discovery with enriched comparative data of targeted agents.治疗靶点数据库更新 2022:通过丰富的靶向药物比较数据促进药物发现。
Nucleic Acids Res. 2022 Jan 7;50(D1):D1398-D1407. doi: 10.1093/nar/gkab953.
6
Molecular optimization by capturing chemist's intuition using deep neural networks.通过使用深度神经网络捕捉化学家的直觉进行分子优化。
J Cheminform. 2021 Mar 20;13(1):26. doi: 10.1186/s13321-021-00497-0.
7
REINVENT 2.0: An AI Tool for De Novo Drug Design.REINVENT 2.0:一种用于从头设计药物的人工智能工具。
J Chem Inf Model. 2020 Dec 28;60(12):5918-5922. doi: 10.1021/acs.jcim.0c00915. Epub 2020 Oct 29.
8
Retrospective on a decade of machine learning for chemical discovery.化学发现领域机器学习十年回顾。
Nat Commun. 2020 Sep 29;11(1):4895. doi: 10.1038/s41467-020-18556-9.
9
Combining Cloud-Based Free-Energy Calculations, Synthetically Aware Enumerations, and Goal-Directed Generative Machine Learning for Rapid Large-Scale Chemical Exploration and Optimization.结合基于云的自由能计算、综合感知枚举和目标导向的生成式机器学习,实现快速大规模化学探索和优化。
J Chem Inf Model. 2020 Sep 28;60(9):4311-4325. doi: 10.1021/acs.jcim.0c00120. Epub 2020 Jun 19.
10
Bidirectional Molecule Generation with Recurrent Neural Networks.双向分子生成的递归神经网络。
J Chem Inf Model. 2020 Mar 23;60(3):1175-1183. doi: 10.1021/acs.jcim.9b00943. Epub 2020 Jan 16.