• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用简单的卷积神经网络生成强大的分子。

Powerful molecule generation with simple ConvNet.

机构信息

AI Drug Discovery, Anticancer Bioscience Ltd, Chengdu, China.

出版信息

Bioinformatics. 2022 Jun 27;38(13):3438-3443. doi: 10.1093/bioinformatics/btac332.

DOI:10.1093/bioinformatics/btac332
PMID:35595245
Abstract

MOTIVATION

Automated molecule generation is a crucial step in in-silico drug discovery. Graph-based generation algorithms have seen significant progress over recent years. However, they are often complex to implement, hard to train and can under-perform when generating long-sequence molecules. The development of a simple and powerful alternative can help improve practicality of automated drug discovery method.

RESULTS

We proposed a ConvNet-based sequential graph generation algorithm. The molecular graph generation problem is reformulated as a sequence of simple classification tasks. At each step, a convolutional neural network operates on a sub-graph that is generated at previous step, and predicts/classifies an atom/bond adding action to populate the input sub-graph. The proposed model is pretrained by learning to sequentially reconstruct existing molecules. The pretrained model is abbreviated as SEEM (structural encoder for engineering molecules). It is then fine-tuned with reinforcement learning to generate molecules with improved properties. The fine-tuned model is named SEED (structural encoder for engineering drug-like-molecules). The proposed models have demonstrated competitive performance comparing to 16 state-of-the-art baselines on three benchmark datasets.

AVAILABILITY AND IMPLEMENTATION

Code is available at https://github.com/yuh8/SEEM and https://github.com/yuh8/SEED. QM9 dataset is availble at http://quantum-machine.org/datasets/, ZINC250k dataset is availble at https://raw.githubusercontent.com/aspuru-guzik-group/chemical_vae/master/models/zinc_properties/250k_rndm_zinc_drugs_clean_3.csv, and ChEMBL dataset is availble at https://www.ebi.ac.uk/chembl/.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

自动化分子生成是计算机药物发现中的关键步骤。基于图的生成算法近年来取得了重大进展。然而,它们通常实现起来很复杂,难以训练,并且在生成长序列分子时性能可能会下降。开发一种简单而强大的替代方案可以帮助提高自动化药物发现方法的实用性。

结果

我们提出了一种基于卷积神经网络的序贯图生成算法。分子图生成问题被重新表述为一系列简单的分类任务。在每一步,一个卷积神经网络对在前一步生成的子图进行操作,并预测/分类添加原子/键的操作,以填充输入子图。所提出的模型通过学习顺序重建现有分子进行预训练。预训练模型缩写为 SEEM(用于工程分子的结构编码器)。然后使用强化学习对其进行微调,以生成具有改进性质的分子。微调后的模型命名为 SEED(用于工程类药物分子的结构编码器)。所提出的模型在三个基准数据集上与 16 种最先进的基线进行比较,表现出了有竞争力的性能。

可用性和实现

代码可在 https://github.com/yuh8/SEEM 和 https://github.com/yuh8/SEED 上获得。QM9 数据集可在 http://quantum-machine.org/datasets/ 获得,ZINC250k 数据集可在 https://raw.githubusercontent.com/aspuru-guzik-group/chemical_vae/master/models/zinc_properties/250k_rndm_zinc_drugs_clean_3.csv 获得,而 ChEMBL 数据集可在 https://www.ebi.ac.uk/chembl/ 获得。

补充信息

补充数据可在 Bioinformatics 在线获得。

相似文献

1
Powerful molecule generation with simple ConvNet.用简单的卷积神经网络生成强大的分子。
Bioinformatics. 2022 Jun 27;38(13):3438-3443. doi: 10.1093/bioinformatics/btac332.
2
Molecular property prediction based on graph structure learning.基于图结构学习的分子性质预测。
Bioinformatics. 2024 May 2;40(5). doi: 10.1093/bioinformatics/btae304.
3
Network-principled deep generative models for designing drug combinations as graph sets.基于网络原理的深度生成模型,用于将药物组合设计为图集合。
Bioinformatics. 2020 Jul 1;36(Suppl_1):i445-i454. doi: 10.1093/bioinformatics/btaa317.
4
Conformer-RL: A deep reinforcement learning library for conformer generation.Conformer-RL:用于 Conformer 生成的深度强化学习库。
J Comput Chem. 2022 Oct 15;43(27):1880-1886. doi: 10.1002/jcc.26984. Epub 2022 Aug 24.
5
Structure-aware protein self-supervised learning.基于结构的蛋白质自监督学习。
Bioinformatics. 2023 Apr 3;39(4). doi: 10.1093/bioinformatics/btad189.
6
HyperAttentionDTI: improving drug-protein interaction prediction by sequence-based deep learning with attention mechanism.超注意力 DTI:基于注意力机制的序列深度学习提高药物-蛋白相互作用预测
Bioinformatics. 2022 Jan 12;38(3):655-662. doi: 10.1093/bioinformatics/btab715.
7
MultiDTI: drug-target interaction prediction based on multi-modal representation learning to bridge the gap between new chemical entities and known heterogeneous network.多模态 DTI:基于多模态表示学习的药物-靶点相互作用预测,以弥合新型化学实体与已知异构网络之间的差距。
Bioinformatics. 2021 Dec 7;37(23):4485-4492. doi: 10.1093/bioinformatics/btab473.
8
CMMS-GCL: cross-modality metabolic stability prediction with graph contrastive learning.CMMS-GCL:基于图对比学习的跨模态代谢稳定性预测。
Bioinformatics. 2023 Aug 1;39(8). doi: 10.1093/bioinformatics/btad503.
9
Hyperbolic relational graph convolution networks plus: a simple but highly efficient QSAR-modeling method.双曲关系图卷积网络加:一种简单但高效的 QSAR 建模方法。
Brief Bioinform. 2021 Sep 2;22(5). doi: 10.1093/bib/bbab112.
10
De novo drug design by iterative multiobjective deep reinforcement learning with graph-based molecular quality assessment.基于图的分子质量评估的迭代多目标深度强化学习的从头药物设计。
Bioinformatics. 2023 Apr 3;39(4). doi: 10.1093/bioinformatics/btad157.

引用本文的文献

1
3D-EDiffMG: 3D equivariant diffusion-driven molecular generation to accelerate drug discovery.3D-EDiffMG:用于加速药物发现的3D等变扩散驱动分子生成
J Pharm Anal. 2025 Jun;15(6):101257. doi: 10.1016/j.jpha.2025.101257. Epub 2025 Mar 5.
2
MTGGF: A Metabolism Type-Aware Graph Generative Model for Molecular Metabolite Prediction.MTGGF:一种用于分子代谢物预测的代谢类型感知图生成模型。
Interdiscip Sci. 2025 Jan 6. doi: 10.1007/s12539-024-00681-4.