文献检索文档翻译深度研究
Suppr Zotero 插件Zotero 插件
邀请有礼套餐&价格历史记录

新学期,新优惠

限时优惠:9月1日-9月22日

30天高级会员仅需29元

1天体验卡首发特惠仅需5.99元

了解详情
不再提醒
插件&应用
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
高级版
套餐订阅购买积分包
AI 工具
文献检索文档翻译深度研究
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2025

XGraphBoost:提取基于图神经网络的特征以更好地预测分子性质。

XGraphBoost: Extracting Graph Neural Network-Based Features for a Better Prediction of Molecular Properties.

机构信息

Fermion Technology Co., Ltd., Guangzhou, Guangdong 510000, P.R. China.

College of Computer Science and Technology, and Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, Jilin University, Changchun, Jilin 130012, P.R. China.

出版信息

J Chem Inf Model. 2021 Jun 28;61(6):2697-2705. doi: 10.1021/acs.jcim.0c01489. Epub 2021 May 19.


DOI:10.1021/acs.jcim.0c01489
PMID:34009965
Abstract

Determining the properties of chemical molecules is essential for screening candidates similar to a specific drug. These candidate molecules are further evaluated for their target binding affinities, side effects, target missing probabilities, etc. Conventional machine learning algorithms demonstrated satisfying prediction accuracies of molecular properties. A molecule cannot be directly loaded into a machine learning model, and a set of engineered features needs to be designed and calculated from a molecule. Such hand-crafted features rely heavily on the experiences of the investigating researchers. The concept of graph neural networks (GNNs) was recently introduced to describe the chemical molecules. The features may be automatically and objectively extracted from the molecules through various types of GNNs, e.g., GCN (graph convolution network), GGNN (gated graph neural network), DMPNN (directed message passing neural network), etc. However, the training of a stable GNN model requires a huge number of training samples and a large amount of computing power, compared with the conventional machine learning strategies. This study proposed the integrated framework XGraphBoost to extract the features using a GNN and build an accurate prediction model of molecular properties using the classifier XGBoost. The proposed framework XGraphBoost fully inherits the merits of the GNN-based automatic molecular feature extraction and XGBoost-based accurate prediction performance. Both classification and regression problems were evaluated using the framework XGraphBoost. The experimental results strongly suggest that XGraphBoost may facilitate the efficient and accurate predictions of various molecular properties. The source code is freely available to academic users at https://github.com/chenxiaowei-vincent/XGraphBoost.git.

摘要

确定化学分子的性质对于筛选与特定药物相似的候选物至关重要。这些候选分子将进一步评估其靶标结合亲和力、副作用、靶标缺失概率等。传统的机器学习算法在预测分子性质方面表现出了令人满意的精度。但分子不能直接加载到机器学习模型中,需要从分子中设计和计算一组工程特征。这些手工制作的特征严重依赖于研究人员的经验。最近引入了图神经网络 (GNN) 的概念来描述化学分子。可以通过各种类型的 GNN(例如 GCN(图卷积网络)、GGNN(门控图神经网络)、DMPNN(定向消息传递神经网络)等)从分子中自动和客观地提取特征。然而,与传统的机器学习策略相比,稳定的 GNN 模型的训练需要大量的训练样本和大量的计算能力。本研究提出了集成框架 XGraphBoost,使用 GNN 提取特征,并使用分类器 XGBoost 构建分子性质的精确预测模型。所提出的框架 XGraphBoost 充分继承了基于 GNN 的自动分子特征提取和基于 XGBoost 的精确预测性能的优点。该框架 XGraphBoost 评估了分类和回归问题。实验结果强烈表明,XGraphBoost 可以促进各种分子性质的高效和精确预测。源代码可在学术用户在 https://github.com/chenxiaowei-vincent/XGraphBoost.git 免费获取。

相似文献

[1]
XGraphBoost: Extracting Graph Neural Network-Based Features for a Better Prediction of Molecular Properties.

J Chem Inf Model. 2021-6-28

[2]
Multiphysical graph neural network (MP-GNN) for COVID-19 drug design.

Brief Bioinform. 2022-7-18

[3]
MD-GNN: A mechanism-data-driven graph neural network for molecular properties prediction and new material discovery.

J Mol Graph Model. 2023-9

[4]
Pre-training graph neural networks for link prediction in biomedical networks.

Bioinformatics. 2022-4-12

[5]
Visualizing Graph Neural Networks With CorGIE: Corresponding a Graph to Its Embedding.

IEEE Trans Vis Comput Graph. 2022-6

[6]
Graph Neural Tree: A novel and interpretable deep learning-based framework for accurate molecular property predictions.

Anal Chim Acta. 2023-3-1

[7]
Improved GNNs for Log  Prediction by Transferring Knowledge from Low-Fidelity Data.

J Chem Inf Model. 2023-4-24

[8]
An Ensemble Spectral Prediction (ESP) model for metabolite annotation.

Bioinformatics. 2024-8-2

[9]
Augmented Graph Neural Network with hierarchical global-based residual connections.

Neural Netw. 2022-6

[10]
CI-GNN: A Granger causality-inspired graph neural network for interpretable brain network-based psychiatric diagnosis.

Neural Netw. 2024-4

引用本文的文献

[1]
Learning motif features and topological structure of molecules for metabolic pathway prediction.

J Cheminform. 2025-4-21

[2]
Recent Advances in the Modeling of Ionic Liquids Using Artificial Neural Networks.

J Chem Inf Model. 2025-4-14

[3]
Graph and Multi-Level Sequence Fusion Learning for Predicting the Molecular Activity of BACE-1 Inhibitors.

Int J Mol Sci. 2025-2-16

[4]
Stress hyperglycemia ratio and machine learning model for prediction of all-cause mortality in patients undergoing cardiac surgery.

Cardiovasc Diabetol. 2025-2-15

[5]
Prototype-based contrastive substructure identification for molecular property prediction.

Brief Bioinform. 2024-9-23

[6]
DrugMetric: quantitative drug-likeness scoring based on chemical space distance.

Brief Bioinform. 2024-5-23

[7]
Multimodal learning system integrating electronic medical records and hysteroscopic images for reproductive outcome prediction and risk stratification of endometrial injury: a multicenter diagnostic study.

Int J Surg. 2024-6-1

[8]
D-CyPre: a machine learning-based tool for accurate prediction of human CYP450 enzyme metabolic sites.

PeerJ Comput Sci. 2024-5-7

[9]
Predicting drug-Protein interaction with deep learning framework for molecular graphs and sequences: Potential candidates against SAR-CoV-2.

PLoS One. 2024

[10]
Experimental and Computational Methods to Assess Central Nervous System Penetration of Small Molecules.

Molecules. 2024-3-13

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

推荐工具

医学文档翻译智能文献检索