基于归因解码神经网络模型在化学中的结合机制。

Using attribution to decode binding mechanism in neural network models for chemistry.

机构信息

Google Research, Mountain View, CA 94043;

Google Research, Mountain View, CA 94043.

出版信息

Proc Natl Acad Sci U S A. 2019 Jun 11;116(24):11624-11629. doi: 10.1073/pnas.1820657116. Epub 2019 May 24.

DOI:10.1073/pnas.1820657116

PMID:31127041

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6575176/

Abstract

Deep neural networks have achieved state-of-the-art accuracy at classifying molecules with respect to whether they bind to specific protein targets. A key breakthrough would occur if these models could reveal the fragment pharmacophores that are causally involved in binding. Extracting chemical details of binding from the networks could enable scientific discoveries about the mechanisms of drug actions. However, doing so requires shining light into the black box that is the trained neural network model, a task that has proved difficult across many domains. Here we show how the binding mechanism learned by deep neural network models can be interrogated, using a recently described attribution method. We first work with carefully constructed synthetic datasets, in which the molecular features responsible for "binding" are fully known. We find that networks that achieve perfect accuracy on held-out test datasets still learn spurious correlations, and we are able to exploit this nonrobustness to construct adversarial examples that fool the model. This makes these models unreliable for accurately revealing information about the mechanisms of protein-ligand binding. In light of our findings, we prescribe a test that checks whether a hypothesized mechanism can be learned. If the test fails, it indicates that the model must be simplified or regularized and/or that the training dataset requires augmentation.

摘要

深度神经网络在针对特定蛋白质靶标是否结合的分子分类方面取得了最先进的准确性。如果这些模型能够揭示出与结合因果相关的片段药效团，那么将发生关键突破。从网络中提取结合的化学细节可以使我们对药物作用机制有科学发现。然而，要做到这一点，需要深入研究经过训练的神经网络模型这个“黑箱”，这在许多领域都被证明是困难的。在这里，我们展示了如何使用最近描述的归因方法来询问深度神经网络模型学习的结合机制。我们首先使用精心构建的合成数据集，其中负责“结合”的分子特征是完全已知的。我们发现，在保留的测试数据集上达到完美准确性的网络仍然会学习到虚假相关性，并且我们能够利用这种不稳健性来构建欺骗模型的对抗示例。这使得这些模型无法准确揭示蛋白质 - 配体结合机制的信息。鉴于我们的发现，我们规定了一个检查假设机制是否可以学习的测试。如果测试失败，则表示模型必须简化或正则化，和/或训练数据集需要扩充。

相似文献

Using attribution to decode binding mechanism in neural network models for chemistry.

Proc Natl Acad Sci U S A. 2019 Jun 11;116(24):11624-11629. doi: 10.1073/pnas.1820657116. Epub 2019 May 24.

Boosted neural networks scoring functions for accurate ligand docking and ranking.

J Bioinform Comput Biol. 2018 Apr;16(2):1850004. doi: 10.1142/S021972001850004X. Epub 2018 Feb 4.

Hierarchical gated recurrent neural network with adversarial and virtual adversarial training on text classification.

Neural Netw. 2019 Nov;119:299-312. doi: 10.1016/j.neunet.2019.08.017. Epub 2019 Sep 2.

Task-Specific Scoring Functions for Predicting Ligand Binding Poses and Affinity and for Screening Enrichment.

J Chem Inf Model. 2018 Jan 22;58(1):119-133. doi: 10.1021/acs.jcim.7b00309. Epub 2017 Dec 20.

Visualizing convolutional neural network protein-ligand scoring.

J Mol Graph Model. 2018 Sep;84:96-108. doi: 10.1016/j.jmgm.2018.06.005. Epub 2018 Jun 18.

A novel end-to-end classifier using domain transferred deep convolutional neural networks for biomedical images.

Comput Methods Programs Biomed. 2017 Mar;140:283-293. doi: 10.1016/j.cmpb.2016.12.019. Epub 2017 Jan 6.

Segmenting brain tumors from FLAIR MRI using fully convolutional neural networks.

Comput Methods Programs Biomed. 2019 Jul;176:135-148. doi: 10.1016/j.cmpb.2019.05.006. Epub 2019 May 11.

DeepDrug3D: Classification of ligand-binding pockets in proteins with a convolutional neural network.

PLoS Comput Biol. 2019 Feb 4;15(2):e1006718. doi: 10.1371/journal.pcbi.1006718. eCollection 2019 Feb.

Statistical and machine learning approaches to predicting protein-ligand interactions.

Curr Opin Struct Biol. 2018 Apr;49:123-128. doi: 10.1016/j.sbi.2018.01.006. Epub 2018 Feb 20.

Vulnerability of classifiers to evolutionary generated adversarial examples.

Neural Netw. 2020 Jul;127:168-181. doi: 10.1016/j.neunet.2020.04.015. Epub 2020 Apr 20.

引用本文的文献

PerturbNet predicts single-cell responses to unseen chemical and genetic perturbations.

Mol Syst Biol. 2025 Jul 10. doi: 10.1038/s44320-025-00131-3.

ACES-GNN: can graph neural network learn to explain activity cliffs?

Digit Discov. 2025 Jun 30. doi: 10.1039/d5dd00012b.

Interpretable Deep-Learning p Prediction for Small Molecule Drugs via Atomic Sensitivity Analysis.

J Chem Inf Model. 2025 Jan 13;65(1):101-113. doi: 10.1021/acs.jcim.4c01472. Epub 2024 Dec 30.

Analyzing Atomic Interactions in Molecules as Learned by Neural Networks.

J Chem Theory Comput. 2025 Jan 28;21(2):714-729. doi: 10.1021/acs.jctc.4c01424. Epub 2025 Jan 10.

Advances, opportunities, and challenges in methods for interrogating the structure activity relationships of natural products.

Nat Prod Rep. 2024 Oct 17;41(10):1543-1578. doi: 10.1039/d4np00009a.

Interpreting Neural Network Models for Toxicity Prediction by Extracting Learned Chemical Features.

J Chem Inf Model. 2024 May 13;64(9):3670-3688. doi: 10.1021/acs.jcim.4c00127. Epub 2024 Apr 30.

Enhancing property and activity prediction and interpretation using multiple molecular graph representations with MMGX.

Commun Chem. 2024 Apr 5;7(1):74. doi: 10.1038/s42004-024-01155-w.

Machine learning coupled with causal inference to identify COVID-19 related chemicals that pose a high concern to drinking water.

iScience. 2024 Jan 24;27(2):109012. doi: 10.1016/j.isci.2024.109012. eCollection 2024 Feb 16.

Impossibility theorems for feature attribution.

Proc Natl Acad Sci U S A. 2024 Jan 9;121(2):e2304406120. doi: 10.1073/pnas.2304406120. Epub 2024 Jan 5.

Integrating Explainability into Graph Neural Network Models for the Prediction of X-ray Absorption Spectra.

J Am Chem Soc. 2023 Oct 18;145(41):22584-22598. doi: 10.1021/jacs.3c07513. Epub 2023 Oct 9.

本文引用的文献

Adversarial Controls for Scientific Machine Learning.

ACS Chem Biol. 2018 Oct 19;13(10):2819-2821. doi: 10.1021/acschembio.8b00881.

Statistical and machine learning approaches to predicting protein-ligand interactions.

Curr Opin Struct Biol. 2018 Apr;49:123-128. doi: 10.1016/j.sbi.2018.01.006. Epub 2018 Feb 20.

Automating drug discovery.

Nat Rev Drug Discov. 2018 Feb;17(2):97-113. doi: 10.1038/nrd.2017.232. Epub 2017 Dec 15.

Quantum-chemical insights from deep tensor neural networks.

Nat Commun. 2017 Jan 9;8:13890. doi: 10.1038/ncomms13890.

Predicting protein-ligand affinity with a random matrix framework.

Proc Natl Acad Sci U S A. 2016 Nov 29;113(48):13564-13569. doi: 10.1073/pnas.1611138113. Epub 2016 Nov 16.

Molecular graph convolutions: moving beyond fingerprints.

J Comput Aided Mol Des. 2016 Aug;30(8):595-608. doi: 10.1007/s10822-016-9938-8. Epub 2016 Aug 24.

Deep neural nets as a method for quantitative structure-activity relationships.

J Chem Inf Model. 2015 Feb 23;55(2):263-74. doi: 10.1021/ci500747n. Epub 2015 Feb 17.

Estimation of the size of drug-like chemical space based on GDB-17 data.

J Comput Aided Mol Des. 2013 Aug;27(8):675-9. doi: 10.1007/s10822-013-9672-4. Epub 2013 Aug 21.

Directory of useful decoys, enhanced (DUD-E): better ligands and decoys for better benchmarking.

J Med Chem. 2012 Jul 26;55(14):6582-94. doi: 10.1021/jm300687e. Epub 2012 Jul 5.

ZINC: a free tool to discover chemistry for biology.

J Chem Inf Model. 2012 Jul 23;52(7):1757-68. doi: 10.1021/ci3001277. Epub 2012 Jun 15.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于归因解码神经网络模型在化学中的结合机制。

Using attribution to decode binding mechanism in neural network models for chemistry.

机构信息

Google Research, Mountain View, CA 94043;

Google Research, Mountain View, CA 94043.

出版信息

Proc Natl Acad Sci U S A. 2019 Jun 11;116(24):11624-11629. doi: 10.1073/pnas.1820657116. Epub 2019 May 24.

DOI:10.1073/pnas.1820657116

PMID:31127041

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6575176/

Abstract

摘要

基于归因解码神经网络模型在化学中的结合机制。

Using attribution to decode binding mechanism in neural network models for chemistry.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

基于归因解码神经网络模型在化学中的结合机制。

Using attribution to decode binding mechanism in neural network models for chemistry.

机构信息

出版信息