一种用于预测分子性质的新型指纹和图混合神经网络。

A New Fingerprint and Graph Hybrid Neural Network for Predicting Molecular Properties.

机构信息

College of Physics Science and Technology, Yangzhou University, Jiangsu 225009, China.

出版信息

J Chem Inf Model. 2024 Aug 12;64(15):5853-5866. doi: 10.1021/acs.jcim.4c00586. Epub 2024 Jul 25.

DOI:10.1021/acs.jcim.4c00586

Abstract

Machine learning plays a role in accelerating drug discovery, and the design of effective machine learning models is crucial for accurately predicting molecular properties. Characterizing molecules typically involves the use of molecular fingerprints and molecular graphs. These are input into a multilayer perceptron (MLP) and variants of graph neural networks, such as graph attention networks (GATs). Due to the diverse types and large dimension of fingerprints, models may contain many features that are relatively irrelevant or redundant; meanwhile, although the GAT excels in handling heterogeneous graph tasks, it lacks the ability to extract collaborative information from neighboring nodes, which is crucial in scenarios where it cannot capture the joint influence of adjacent groups on atoms. To overcome these challenges, we introduce a hybrid model, combining improved GAT and MLP. In GAT, the recurrent neural network is employed to capture collaborative information. To address the dimensionality issue, we propose a feature selection algorithm, which is based on the principle of maximizing relevance while minimizing redundancy. Through experiments on 13 public data sets and 14 breast cell lines, our model demonstrates superior performance compared to state-of-the-art deep learning and traditional machine learning algorithms. Additionally, a series of ablation experiments were conducted to demonstrate the advantages of our improved version, as well as its antinoise capability and interpretability. These results indicate that our model holds promising prospects for practical applications.

摘要

机器学习在加速药物发现方面发挥着作用，设计有效的机器学习模型对于准确预测分子性质至关重要。分子特征通常涉及使用分子指纹和分子图。这些输入到多层感知机（MLP）和图神经网络的变体，如图注意网络（GAT）中。由于指纹的类型多样且维度较大，模型可能包含许多相对无关或冗余的特征；同时，虽然 GAT 在处理异构图任务方面表现出色，但它缺乏从相邻节点提取协作信息的能力，而在无法捕捉相邻基团对原子的联合影响的情况下，这是至关重要的。为了克服这些挑战，我们引入了一种混合模型，结合了改进的 GAT 和 MLP。在 GAT 中，递归神经网络用于捕获协作信息。为了解决维度问题，我们提出了一种特征选择算法，该算法基于最大化相关性同时最小化冗余性的原则。通过在 13 个公共数据集和 14 个乳腺细胞系上进行实验，我们的模型与最先进的深度学习和传统机器学习算法相比表现出了优越的性能。此外，还进行了一系列消融实验，以证明我们改进版本的优势，以及其抗噪声能力和可解释性。这些结果表明，我们的模型在实际应用中具有广阔的前景。

相似文献

A New Fingerprint and Graph Hybrid Neural Network for Predicting Molecular Properties.

J Chem Inf Model. 2024 Aug 12;64(15):5853-5866. doi: 10.1021/acs.jcim.4c00586. Epub 2024 Jul 25.

Predicting miRNA-disease association via graph attention learning and multiplex adaptive modality fusion.

Comput Biol Med. 2024 Feb;169:107904. doi: 10.1016/j.compbiomed.2023.107904. Epub 2023 Dec 28.

FP-GNN: a versatile deep learning architecture for enhanced molecular property prediction.

Brief Bioinform. 2022 Nov 19;23(6). doi: 10.1093/bib/bbac408.

Graph-DTI: A New Model for Drug-target Interaction Prediction Based on Heterogenous Network Graph Embedding.

Curr Comput Aided Drug Des. 2024;20(6):1013-1024. doi: 10.2174/1573409919666230713142255.

Meta Learning With Graph Attention Networks for Low-Data Drug Discovery.

IEEE Trans Neural Netw Learn Syst. 2024 Aug;35(8):11218-11230. doi: 10.1109/TNNLS.2023.3250324. Epub 2024 Aug 5.

Data Integration Using Advances in Machine Learning in Drug Discovery and Molecular Biology.

Methods Mol Biol. 2021;2190:167-184. doi: 10.1007/978-1-0716-0826-5_7.

Fingerprint-Enhanced Graph Attention Network (FinGAT) Model for Antibiotic Discovery.

J Chem Inf Model. 2023 May 22;63(10):2928-2935. doi: 10.1021/acs.jcim.3c00045. Epub 2023 May 11.

Combining handcrafted features with latent variables in machine learning for prediction of radiation-induced lung damage.

Med Phys. 2019 May;46(5):2497-2511. doi: 10.1002/mp.13497. Epub 2019 Apr 8.

Benchmarking Accuracy and Generalizability of Four Graph Neural Networks Using Large In Vitro ADME Datasets from Different Chemical Spaces.

Mol Inform. 2022 Aug;41(8):e2100321. doi: 10.1002/minf.202100321. Epub 2022 Feb 23.

Drug-target affinity prediction with extended graph learning-convolutional networks.

BMC Bioinformatics. 2024 Feb 16;25(1):75. doi: 10.1186/s12859-024-05698-6.

引用本文的文献

The future of pharmaceuticals: Artificial intelligence in drug discovery and development.

J Pharm Anal. 2025 Aug;15(8):101248. doi: 10.1016/j.jpha.2025.101248. Epub 2025 Feb 26.

Enhancing molecular property prediction with quantized GNN models.

J Cheminform. 2025 May 26;17(1):81. doi: 10.1186/s13321-025-00989-3.

Graph-Aware AURALSTM: An Attentive Unified Representation Architecture with BiLSTM for Enhanced Molecular Property Prediction.

Mol Divers. 2025 Apr 25. doi: 10.1007/s11030-025-11197-4.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

一种用于预测分子性质的新型指纹和图混合神经网络。

A New Fingerprint and Graph Hybrid Neural Network for Predicting Molecular Properties.

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献