简化可解释的图卷积神经网络用于小分子活性预测。

Simplified, interpretable graph convolutional neural networks for small molecule activity prediction.

机构信息

IBM Thomas J Watson Research Center, Yorktown Heights, NY, USA.

出版信息

J Comput Aided Mol Des. 2022 May;36(5):391-404. doi: 10.1007/s10822-021-00421-6. Epub 2021 Nov 24.

DOI:10.1007/s10822-021-00421-6

PMID:34817762

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9325818/

Abstract

We here present a streamlined, explainable graph convolutional neural network (gCNN) architecture for small molecule activity prediction. We first conduct a hyperparameter optimization across nearly 800 protein targets that produces a simplified gCNN QSAR architecture, and we observe that such a model can yield performance improvements over both standard gCNN and RF methods on difficult-to-classify test sets. Additionally, we discuss how reductions in convolutional layer dimensions potentially speak to the "anatomical" needs of gCNNs with respect to radial coarse graining of molecular substructure. We augment this simplified architecture with saliency map technology that highlights molecular substructures relevant to activity, and we perform saliency analysis on nearly 100 data-rich protein targets. We show that resultant substructural clusters are useful visualization tools for understanding substructure-activity relationships. We go on to highlight connections between our models' saliency predictions and observations made in the medicinal chemistry literature, focusing on four case studies of past lead finding and lead optimization campaigns.

摘要

我们在这里提出了一种简化的、可解释的图卷积神经网络（gCNN）架构，用于小分子活性预测。我们首先在近 800 个蛋白质靶标上进行了超参数优化，得到了一个简化的 gCNN-QSAR 架构，我们观察到，与标准的 gCNN 和 RF 方法相比，该模型在难以分类的测试集上可以提高性能。此外，我们讨论了卷积层维度的减少如何可能与 gCNN 相对于分子亚结构的径向粗粒化的“解剖”需求有关。我们使用显着性映射技术来增强这个简化的架构，突出与活性相关的分子亚结构，并对近 100 个数据丰富的蛋白质靶标进行显着性分析。我们表明，所得的亚结构聚类是用于理解亚结构-活性关系的有用可视化工具。我们继续强调我们的模型的显着性预测与药物化学文献中的观察结果之间的联系，重点介绍过去的先导发现和先导优化活动的四个案例研究。

相似文献

Simplified, interpretable graph convolutional neural networks for small molecule activity prediction.简化可解释的图卷积神经网络用于小分子活性预测。

J Comput Aided Mol Des. 2022 May;36(5):391-404. doi: 10.1007/s10822-021-00421-6. Epub 2021 Nov 24.

MutagenPred-GCNNs: A Graph Convolutional Neural Network-Based Classification Model for Mutagenicity Prediction with Data-Driven Molecular Fingerprints.MutagenPred-GCNNs：一种基于图卷积神经网络的分类模型，用于使用数据驱动的分子指纹进行致突变性预测。

Interdiscip Sci. 2021 Mar;13(1):25-33. doi: 10.1007/s12539-020-00407-2. Epub 2021 Jan 27.

Stable feature selection utilizing Graph Convolutional Neural Network and Layer-wise Relevance Propagation for biomarker discovery in breast cancer.利用图卷积神经网络和逐层相关性传播进行稳定特征选择，以发现乳腺癌的生物标志物。

Artif Intell Med. 2024 May;151:102840. doi: 10.1016/j.artmed.2024.102840. Epub 2024 Mar 11.

Prediction and interpretation of cancer survival using graph convolution neural networks.基于图卷积神经网络的癌症生存预测和解释。

Methods. 2021 Aug;192:120-130. doi: 10.1016/j.ymeth.2021.01.004. Epub 2021 Jan 21.

Essential genes identification model based on sequence feature map and graph convolutional neural network.基于序列特征图和图卷积神经网络的必需基因识别模型。

BMC Genomics. 2024 Jan 10;25(1):47. doi: 10.1186/s12864-024-09958-w.

Machine Learning with Enormous "Synthetic" Data Sets: Predicting Glass Transition Temperature of Polyimides Using Graph Convolutional Neural Networks.利用大量“合成”数据集进行机器学习：使用图卷积神经网络预测聚酰亚胺的玻璃化转变温度

ACS Omega. 2022 Nov 17;7(48):43678-43691. doi: 10.1021/acsomega.2c04649. eCollection 2022 Dec 6.

Learning graph in graph convolutional neural networks for robust seizure prediction.在图卷积神经网络中学习图以进行鲁棒性癫痫发作预测。

J Neural Eng. 2020 Jun 22;17(3):035004. doi: 10.1088/1741-2552/ab909d.

Graph Convolutional Neural Networks as "General-Purpose" Property Predictors: The Universality and Limits of Applicability.图卷积神经网络作为“通用”属性预测器：适用性的普遍性和局限性。

J Chem Inf Model. 2020 Jan 27;60(1):22-28. doi: 10.1021/acs.jcim.9b00587. Epub 2020 Jan 3.

Interpretable-ADMET: a web service for ADMET prediction and optimization based on deep neural representation.可解释的 ADMET：基于深度神经表示的 ADMET 预测和优化的网络服务。

Bioinformatics. 2022 May 13;38(10):2863-2871. doi: 10.1093/bioinformatics/btac192.

Graph-Convolutional Neural Net Model of the Statistical Torsion Profiles for Small Organic Molecules.小分子统计扭曲构象的图卷积神经网络模型。

J Chem Inf Model. 2022 Dec 12;62(23):5896-5906. doi: 10.1021/acs.jcim.2c00790. Epub 2022 Dec 1.

引用本文的文献

I‑GAT: Interpretable Graph Attention Networks for Ligand Optimization.I‑GAT：用于配体优化的可解释图注意力网络

ACS Omega. 2025 Jul 21;10(30):32968-32986. doi: 10.1021/acsomega.5c02173. eCollection 2025 Aug 5.

AI-Based Computational Methods in Early Drug Discovery and Post Market Drug Assessment: A Survey.早期药物发现与上市后药物评估中基于人工智能的计算方法：一项综述。

IEEE Trans Comput Biol Bioinform. 2025 Jan-Feb;22(1):97-115. doi: 10.1109/TCBB.2024.3492708.

A Comprehensive Investigation: Developing the Pharmaceutical Industry through Artificial Intelligence.一项全面调查：通过人工智能发展制药行业

Curr Drug Discov Technol. 2024 Sep 5. doi: 10.2174/0115701638313233240830132804.

Topological regression as an interpretable and efficient tool for quantitative structure-activity relationship modeling.拓扑回归作为一种用于定量构效关系建模的可解释且高效的工具。

Nat Commun. 2024 Jun 13;15(1):5072. doi: 10.1038/s41467-024-49372-0.

Enhancing property and activity prediction and interpretation using multiple molecular graph representations with MMGX.使用MMGX通过多种分子图表示增强性质和活性预测及解释。

Commun Chem. 2024 Apr 5;7(1):74. doi: 10.1038/s42004-024-01155-w.

Advancing material property prediction: using physics-informed machine learning models for viscosity.推进材料性能预测：使用物理信息机器学习模型预测粘度。

J Cheminform. 2024 Mar 14;16(1):31. doi: 10.1186/s13321-024-00820-5.

Cheminformatics and artificial intelligence for accelerating agrochemical discovery.用于加速农用化学品发现的化学信息学与人工智能

Front Chem. 2023 Nov 29;11:1292027. doi: 10.3389/fchem.2023.1292027. eCollection 2023.

Extended study on atomic featurization in graph neural networks for molecular property prediction.用于分子性质预测的图神经网络中原子特征化的扩展研究。

J Cheminform. 2023 Sep 19;15(1):81. doi: 10.1186/s13321-023-00751-7.

Molecular Property Prediction by Combining LSTM and GAT.基于 LSTM 和 GAT 的分子性质预测。

Biomolecules. 2023 Mar 9;13(3):503. doi: 10.3390/biom13030503.

A Perspective on Explanations of Molecular Prediction Models.分子预测模型解释的透视。

J Chem Theory Comput. 2023 Apr 25;19(8):2149-2160. doi: 10.1021/acs.jctc.2c01235. Epub 2023 Mar 27.

本文引用的文献

A self-attention based message passing neural network for predicting molecular lipophilicity and aqueous solubility.一种基于自注意力的消息传递神经网络，用于预测分子亲脂性和水溶性。

J Cheminform. 2020 Feb 21;12(1):15. doi: 10.1186/s13321-020-0414-z.

The Literature of Chemoinformatics: 1978-2018.《化学生信学文献：1978-2018》。

Int J Mol Sci. 2020 Aug 4;21(15):5576. doi: 10.3390/ijms21155576.

QSAR without borders.无边界定量构效关系。

Chem Soc Rev. 2020 Jun 7;49(11):3525-3564. doi: 10.1039/d0cs00098a. Epub 2020 May 1.

A Deep Learning Approach to Antibiotic Discovery.深度学习在抗生素发现中的应用。

Cell. 2020 Feb 20;180(4):688-702.e13. doi: 10.1016/j.cell.2020.01.021.

Combining Docking Pose Rank and Structure with Deep Learning Improves Protein-Ligand Binding Mode Prediction over a Baseline Docking Approach.结合对接构象排序和深度学习可提高基于对接方法的蛋白-配体结合模式预测。

J Chem Inf Model. 2020 Sep 28;60(9):4170-4179. doi: 10.1021/acs.jcim.9b00927. Epub 2020 Mar 3.

All-Assay-Max2 pQSAR: Activity Predictions as Accurate as Four-Concentration ICs for 8558 Novartis Assays.All-Assay-Max2 pQSAR：对 8558 项诺华测定法的活性预测，其准确性可媲美四浓度 ICs。

J Chem Inf Model. 2019 Oct 28;59(10):4450-4459. doi: 10.1021/acs.jcim.9b00375. Epub 2019 Sep 26.

Pushing the Boundaries of Molecular Representation for Drug Discovery with the Graph Attention Mechanism.利用图注意力机制拓展药物发现中分子表示的边界。

J Med Chem. 2020 Aug 27;63(16):8749-8760. doi: 10.1021/acs.jmedchem.9b00959. Epub 2019 Aug 27.

Graph dynamical networks for unsupervised learning of atomic scale dynamics in materials.用于材料中原子尺度动力学无监督学习的图动态网络。

Nat Commun. 2019 Jun 17;10(1):2667. doi: 10.1038/s41467-019-10663-6.

Using attribution to decode binding mechanism in neural network models for chemistry.基于归因解码神经网络模型在化学中的结合机制。

Proc Natl Acad Sci U S A. 2019 Jun 11;116(24):11624-11629. doi: 10.1073/pnas.1820657116. Epub 2019 May 24.

A graph-based genetic algorithm and generative model/Monte Carlo tree search for the exploration of chemical space.一种基于图的遗传算法和生成模型/蒙特卡罗树搜索方法用于化学空间探索。

Chem Sci. 2019 Feb 11;10(12):3567-3572. doi: 10.1039/c8sc05372c. eCollection 2019 Mar 28.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

简化可解释的图卷积神经网络用于小分子活性预测。

Simplified, interpretable graph convolutional neural networks for small molecule activity prediction.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献