AweGNN：用于分子的自动参数化加权元素特定图神经网络。

AweGNN: Auto-parametrized weighted element-specific graph neural networks for molecules.

机构信息

Department of Mathematics, Michigan State University, MI, 48824, USA.

Department of Mathematics, University of Kentucky, KY, 40506, USA.

出版信息

Comput Biol Med. 2021 Jul;134:104460. doi: 10.1016/j.compbiomed.2021.104460. Epub 2021 May 12.

DOI:10.1016/j.compbiomed.2021.104460

PMID:34020133

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8263495/

Abstract

While automated feature extraction has had tremendous success in many deep learning algorithms for image analysis and natural language processing, it does not work well for data involving complex internal structures, such as molecules. Data representations via advanced mathematics, including algebraic topology, differential geometry, and graph theory, have demonstrated superiority in a variety of biomolecular applications, however, their performance is often dependent on manual parametrization. This work introduces the auto-parametrized weighted element-specific graph neural network, dubbed AweGNN, to overcome the obstacle of this tedious parametrization process while also being a suitable technique for automated feature extraction on these internally complex biomolecular data sets. The AweGNN is a neural network model based on geometric-graph features of element-pair interactions, with its graph parameters being updated throughout the training, which results in what we call a network-enabled automatic representation (NEAR). To enhance the predictions with small data sets, we construct multi-task (MT) AweGNN models in addition to single-task (ST) AweGNN models. The proposed methods are applied to various benchmark data sets, including four data sets for quantitative toxicity analysis and another data set for solvation prediction. Extensive numerical tests show that AweGNN models can achieve state-of-the-art performance in molecular property predictions.

摘要

虽然自动化特征提取在图像分析和自然语言处理的许多深度学习算法中取得了巨大成功，但它不适用于涉及复杂内部结构的数据，例如分子。通过高级数学表示的数据，包括代数拓扑、微分几何和图论，在各种生物分子应用中表现出优越性，但是，它们的性能通常取决于手动参数化。这项工作引入了自动参数化加权元素特定图神经网络（AweGNN），以克服这个繁琐的参数化过程的障碍，同时也是一种适用于这些内部复杂生物分子数据集的自动化特征提取的技术。AweGNN 是一种基于元素对相互作用的几何图特征的神经网络模型，其图参数在整个训练过程中不断更新，这导致了我们所谓的网络启用自动表示（NEAR）。为了增强小数据集的预测能力，我们构建了多任务（MT）AweGNN 模型，除了单任务（ST）AweGNN 模型。所提出的方法应用于各种基准数据集，包括四个用于定量毒性分析的数据集和另一个用于溶剂化预测的数据集。广泛的数值测试表明，AweGNN 模型可以在分子性质预测中达到最先进的性能。

相似文献

AweGNN: Auto-parametrized weighted element-specific graph neural networks for molecules.AweGNN：用于分子的自动参数化加权元素特定图神经网络。

Comput Biol Med. 2021 Jul;134:104460. doi: 10.1016/j.compbiomed.2021.104460. Epub 2021 May 12.

Quantitative Toxicity Prediction Using Topology Based Multitask Deep Neural Networks.基于拓扑的多任务深度神经网络的定量毒性预测。

J Chem Inf Model. 2018 Feb 26;58(2):520-531. doi: 10.1021/acs.jcim.7b00558. Epub 2018 Jan 31.

Algebraic graph-assisted bidirectional transformers for molecular property prediction.基于代数图辅助的双向转换器在分子性质预测中的应用。

Nat Commun. 2021 Jun 10;12(1):3521. doi: 10.1038/s41467-021-23720-w.

TopologyNet: Topology based deep convolutional and multi-task neural networks for biomolecular property predictions.拓扑网络：用于生物分子性质预测的基于拓扑的深度卷积和多任务神经网络。

PLoS Comput Biol. 2017 Jul 27;13(7):e1005690. doi: 10.1371/journal.pcbi.1005690. eCollection 2017 Jul.

Augmented Graph Neural Network with hierarchical global-based residual connections.基于层次全局残差连接的增强图神经网络。

Neural Netw. 2022 Jun;150:149-166. doi: 10.1016/j.neunet.2022.03.008. Epub 2022 Mar 10.

Graph Neural Network-Based Diagnosis Prediction.基于图神经网络的诊断预测。

Big Data. 2020 Oct;8(5):379-390. doi: 10.1089/big.2020.0070. Epub 2020 Aug 12.

Composite Graph Neural Networks for Molecular Property Prediction.用于分子性质预测的组合图神经网络。

Int J Mol Sci. 2024 Jun 14;25(12):6583. doi: 10.3390/ijms25126583.

Blinded Predictions and Post Hoc Analysis of the Second Solubility Challenge Data: Exploring Training Data and Feature Set Selection for Machine and Deep Learning Models.盲法预测和事后分析第二次溶解度挑战数据：探索机器学习和深度学习模型的训练数据和特征集选择。

J Chem Inf Model. 2023 Feb 27;63(4):1099-1113. doi: 10.1021/acs.jcim.2c01189. Epub 2023 Feb 9.

QSAR modeling without descriptors using graph convolutional neural networks: the case of mutagenicity prediction.使用图卷积神经网络的无描述符定量构效关系建模：以致突变性预测为例

Mol Divers. 2021 Aug;25(3):1283-1299. doi: 10.1007/s11030-021-10250-2. Epub 2021 Jun 19.

Drug-target interaction predication via multi-channel graph neural networks.基于多通道图神经网络的药物-靶标相互作用预测。

Brief Bioinform. 2022 Jan 17;23(1). doi: 10.1093/bib/bbab346.

引用本文的文献

SS-GNN: A Simple-Structured Graph Neural Network for Affinity Prediction.SS-GNN：一种用于亲和力预测的结构简单的图神经网络。

ACS Omega. 2023 Jun 15;8(25):22496-22507. doi: 10.1021/acsomega.3c00085. eCollection 2023 Jun 27.

本文引用的文献

Persistent spectral graph.持续谱图。

Int J Numer Method Biomed Eng. 2020 Sep;36(9):e3376. doi: 10.1002/cnm.3376. Epub 2020 Aug 17.

Quantitative adverse outcome pathway (qAOP) models for toxicity prediction.定量不良结局途径 (qAOP) 模型用于毒性预测。

Arch Toxicol. 2020 May;94(5):1497-1510. doi: 10.1007/s00204-020-02774-7. Epub 2020 May 18.

A review of mathematical representations of biomolecular data.生物分子数据的数学表示方法综述。

Phys Chem Chem Phys. 2020 Feb 26;22(8):4343-4367. doi: 10.1039/c9cp06554g.

Weighted persistent homology for biomolecular data analysis.生物分子数据分析的加权持续同调。

Sci Rep. 2020 Feb 7;10(1):2079. doi: 10.1038/s41598-019-55660-3.

Boosting Tree-Assisted Multitask Deep Learning for Small Scientific Datasets.基于提升树的多任务深度学习在小科学数据集上的应用。

J Chem Inf Model. 2020 Mar 23;60(3):1235-1244. doi: 10.1021/acs.jcim.9b01184. Epub 2020 Feb 3.

AGL-Score: Algebraic Graph Learning Score for Protein-Ligand Binding Scoring, Ranking, Docking, and Screening.AGL-Score：用于蛋白质-配体结合评分、排序、对接和筛选的代数图学习评分。

J Chem Inf Model. 2019 Jul 22;59(7):3291-3304. doi: 10.1021/acs.jcim.9b00334. Epub 2019 Jul 1.

End-Point Binding Free Energy Calculation with MM/PBSA and MM/GBSA: Strategies and Applications in Drug Design.基于 MM/PBSA 和 MM/GBSA 的终点结合自由能计算：在药物设计中的策略与应用。

Chem Rev. 2019 Aug 28;119(16):9478-9508. doi: 10.1021/acs.chemrev.9b00055. Epub 2019 Jun 24.

DG-GL: Differential geometry-based geometric learning of molecular datasets.基于微分几何的分子数据集的几何学习。

Int J Numer Method Biomed Eng. 2019 Mar;35(3):e3179. doi: 10.1002/cnm.3179. Epub 2019 Feb 7.

Multi-Objective Genetic Algorithm (MOGA) As a Feature Selecting Strategy in the Development of Ionic Liquids' Quantitative Toxicity-Toxicity Relationship Models.多目标遗传算法（MOGA）作为离子液体定量毒性-毒性关系模型开发中的特征选择策略。

J Chem Inf Model. 2018 Dec 24;58(12):2467-2476. doi: 10.1021/acs.jcim.8b00378. Epub 2018 Dec 14.

Machine learning for molecular and materials science.机器学习在分子和材料科学中的应用。

Nature. 2018 Jul;559(7715):547-555. doi: 10.1038/s41586-018-0337-2. Epub 2018 Jul 25.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。