利用深度学习揭示分子中的细胞毒性亚结构。

Revealing cytotoxic substructures in molecules using deep learning.

机构信息

In silico Toxicology and Structural Bioinformatics, Institute of Physiology, Charité-Universitätsmedizin Berlin, Charitéplatz 1, 10117, Berlin, Germany.

Leibniz-Forschungsinstitut für Molekulare Pharmakologie (FMP), Robert-Roessle Strasse 10, 13125, Berlin, Germany.

出版信息

J Comput Aided Mol Des. 2020 Jul;34(7):731-746. doi: 10.1007/s10822-020-00310-4. Epub 2020 Apr 16.

DOI:10.1007/s10822-020-00310-4

PMID:32297073

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7292813/

Abstract

In drug development, late stage toxicity issues of a compound are the main cause of failure in clinical trials. In silico methods are therefore of high importance to guide the early design process to reduce time, costs and animal testing. Technical advances and the ever growing amount of available toxicity data enabled machine learning, especially neural networks, to impact the field of predictive toxicology. In this study, cytotoxicity prediction, one of the earliest handles in drug discovery, is investigated using a deep learning approach trained on a highly consistent in-house data set of over 34,000 compounds with a share of less than 5% of cytotoxic molecules. The model reached a balanced accuracy of over 70%, similar to previously reported studies using Random Forest. Albeit yielding good results, neural networks are often described as a black box lacking deeper mechanistic understanding of the underlying model. To overcome this absence of interpretability, a Deep Taylor Decomposition method is investigated to identify substructures that may be responsible for the cytotoxic effects, the so-called toxicophores. Furthermore, this study introduces cytotoxicity maps which provide a visual structural interpretation of the relevance of these substructures. Using this approach could be helpful in drug development to predict the potential toxicity of a compound as well as to generate new insights into the toxic mechanism. Moreover, it could also help to de-risk and optimize compounds.

摘要

在药物开发中，化合物的后期毒性问题是临床试验失败的主要原因。因此，计算方法对于指导早期设计过程以减少时间、成本和动物测试非常重要。技术的进步和可用毒性数据的不断增加，使得机器学习，特别是神经网络，能够对预测毒理学领域产生影响。在这项研究中，使用一种经过高度一致的内部数据集训练的深度学习方法来研究细胞毒性预测，该数据集包含超过 34000 种化合物，其中不到 5%的化合物具有细胞毒性。该模型的平衡准确率超过 70%，与之前使用随机森林报告的研究相似。尽管神经网络产生了很好的结果，但它们通常被描述为一个缺乏对基础模型更深入机制理解的黑盒子。为了克服这种缺乏可解释性的情况，研究人员调查了一种深度泰勒分解方法，以确定可能导致细胞毒性的亚结构，即所谓的毒性基团。此外，本研究引入了细胞毒性图谱，为这些亚结构的相关性提供了直观的结构解释。这种方法可以帮助药物开发人员预测化合物的潜在毒性，并深入了解毒性机制。此外，它还有助于降低风险和优化化合物。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a01/7292813/10fe0552a456/10822_2020_310_Fig1_HTML.jpg

相似文献

Revealing cytotoxic substructures in molecules using deep learning.利用深度学习揭示分子中的细胞毒性亚结构。

J Comput Aided Mol Des. 2020 Jul;34(7):731-746. doi: 10.1007/s10822-020-00310-4. Epub 2020 Apr 16.

Data Integration Using Advances in Machine Learning in Drug Discovery and Molecular Biology.利用机器学习进展进行药物发现和分子生物学中的数据整合

Methods Mol Biol. 2021;2190:167-184. doi: 10.1007/978-1-0716-0826-5_7.

DeepSIBA: chemical structure-based inference of biological alterations using deep learning.深度SIBA：使用深度学习基于化学结构推断生物变化

Mol Omics. 2021 Feb 1;17(1):108-120. doi: 10.1039/d0mo00129e. Epub 2020 Nov 14.

The power of deep learning to ligand-based novel drug discovery.深度学习在基于配体的新药发现中的作用。

Expert Opin Drug Discov. 2020 Jul;15(7):755-764. doi: 10.1080/17460441.2020.1745183. Epub 2020 Mar 31.

Comparative study between deep learning and QSAR classifications for TNBC inhibitors and novel GPCR agonist discovery.深度学习与 QSAR 分类在三阴性乳腺癌抑制剂和新型 G 蛋白偶联受体激动剂发现中的比较研究。

Sci Rep. 2020 Oct 8;10(1):16771. doi: 10.1038/s41598-020-73681-1.

A compact review of progress and prospects of deep learning in drug discovery.深度学习在药物发现中的进展与前景简要综述。

J Mol Model. 2023 Mar 28;29(4):117. doi: 10.1007/s00894-023-05492-w.

Deep Learning in Virtual Screening: Recent Applications and Developments.深度学习在虚拟筛选中的应用及进展。

Int J Mol Sci. 2021 Apr 23;22(9):4435. doi: 10.3390/ijms22094435.

Diversifying chemical libraries with generative topographic mapping.利用生成地形映射对化学文库进行多样化处理。

J Comput Aided Mol Des. 2020 Jul;34(7):805-815. doi: 10.1007/s10822-019-00215-x. Epub 2019 Aug 12.

A renaissance of neural networks in drug discovery.神经网络在药物发现中的复兴。

Expert Opin Drug Discov. 2016 Aug;11(8):785-95. doi: 10.1080/17460441.2016.1201262. Epub 2016 Jul 4.

Reinforced Adversarial Neural Computer for de Novo Molecular Design.强化对抗神经网络计算机用于从头分子设计。

J Chem Inf Model. 2018 Jun 25;58(6):1194-1204. doi: 10.1021/acs.jcim.7b00690. Epub 2018 Jun 12.

引用本文的文献

Explainable Artificial Intelligence in the Field of Drug Research.药物研究领域中的可解释人工智能

Drug Des Devel Ther. 2025 May 29;19:4501-4516. doi: 10.2147/DDDT.S525171. eCollection 2025.

Establishment of interpretable cytotoxicity prediction models using machine learning analysis of transcriptome features.利用转录组特征的机器学习分析建立可解释的细胞毒性预测模型。

Acta Pharm Sin B. 2025 Mar;15(3):1344-1358. doi: 10.1016/j.apsb.2025.02.009. Epub 2025 Feb 12.

Cyto-Safe: A Machine Learning Tool for Early Identification of Cytotoxic Compounds in Drug Discovery.细胞安全：一种用于药物发现中细胞毒性化合物早期识别的机器学习工具。

J Chem Inf Model. 2024 Dec 23;64(24):9056-9062. doi: 10.1021/acs.jcim.4c01811. Epub 2024 Dec 11.

Sort & Slice: a simple and superior alternative to hash-based folding for extended-connectivity fingerprints.排序与切片：一种用于扩展连接性指纹的、比基于哈希的折叠更简单且更优的替代方法。

J Cheminform. 2024 Dec 3;16(1):135. doi: 10.1186/s13321-024-00932-y.

Unlocking the potential of AI: Machine learning and deep learning models for predicting carcinogenicity of chemicals.释放人工智能的潜力：用于预测化学物质致癌性的机器学习和深度学习模型

J Environ Sci Health C Toxicol Carcinog. 2025;43(1):23-50. doi: 10.1080/26896583.2024.2396731. Epub 2024 Sep 3.

Review of machine learning and deep learning models for toxicity prediction.机器学习和深度学习模型在毒性预测中的应用综述。

Exp Biol Med (Maywood). 2023 Nov;248(21):1952-1973. doi: 10.1177/15353702231209421. Epub 2023 Dec 6.

Automatic identification of chemical moieties.化学基团的自动识别。

Phys Chem Chem Phys. 2023 Oct 4;25(38):26370-26379. doi: 10.1039/d3cp03845a.

Artificial intelligence for natural product drug discovery.人工智能在天然产物药物发现中的应用。

Nat Rev Drug Discov. 2023 Nov;22(11):895-916. doi: 10.1038/s41573-023-00774-7. Epub 2023 Sep 11.

Evaluating the utility of a high throughput thiol-containing fluorescent probe to screen for reactivity: A case study with the Tox21 library.评估一种高通量含硫醇荧光探针用于筛选反应性的效用：以Tox21文库为例的研究

Comput Toxicol. 2023 May;26. doi: 10.1016/j.comtox.2023.100271.

Discovery of 4-aminoindole carboxamide derivatives to curtail alpha-synuclein and tau isoform 2N4R oligomer formation.发现4-氨基吲哚甲酰胺衍生物可抑制α-突触核蛋白和tau异构体2N4R寡聚体的形成。

Results Chem. 2023 Jan;5. doi: 10.1016/j.rechem.2023.100938. Epub 2023 Apr 28.

本文引用的文献

Toward Explainable Anticancer Compound Sensitivity Prediction via Multimodal Attention-Based Convolutional Encoders.通过基于多模态注意力的卷积编码器实现可解释的抗癌化合物敏感性预测。

Mol Pharm. 2019 Dec 2;16(12):4797-4806. doi: 10.1021/acs.molpharmaceut.9b00520. Epub 2019 Oct 31.

Deep Learning in Chemistry.深度学习在化学中的应用。

J Chem Inf Model. 2019 Jun 24;59(6):2545-2559. doi: 10.1021/acs.jcim.9b00266. Epub 2019 Jun 13.

Learning continuous and data-driven molecular descriptors by translating equivalent chemical representations.通过转换等效化学表示来学习连续且数据驱动的分子描述符。

Chem Sci. 2018 Nov 19;10(6):1692-1701. doi: 10.1039/c8sc04175j. eCollection 2019 Feb 14.

Interpretation of QSAR Models by Coloring Atoms According to Changes in Predicted Activity: How Robust Is It?根据预测活性的变化对原子进行着色来解释 QSAR 模型：它的稳健性如何？

J Chem Inf Model. 2019 Apr 22;59(4):1324-1337. doi: 10.1021/acs.jcim.8b00825. Epub 2019 Mar 4.

Large-scale comparison of machine learning methods for drug target prediction on ChEMBL.基于ChEMBL的药物靶点预测机器学习方法的大规模比较

Chem Sci. 2018 Jun 6;9(24):5441-5451. doi: 10.1039/c8sc00148k. eCollection 2018 Jun 28.

Modelling compound cytotoxicity using conformal prediction and PubChem HTS data.使用共形预测和PubChem高通量筛选数据对化合物细胞毒性进行建模。

Toxicol Res (Camb). 2016 Oct 31;6(1):73-80. doi: 10.1039/c6tx00252h. eCollection 2017 Jan 1.

Deep reinforcement learning for de novo drug design.基于深度强化学习的从头药物设计。

Sci Adv. 2018 Jul 25;4(7):eaap7885. doi: 10.1126/sciadv.aap7885. eCollection 2018 Jul.

Visualizing convolutional neural network protein-ligand scoring.可视化卷积神经网络的蛋白质配体评分。

J Mol Graph Model. 2018 Sep;84:96-108. doi: 10.1016/j.jmgm.2018.06.005. Epub 2018 Jun 18.

Reinforced Adversarial Neural Computer for de Novo Molecular Design.强化对抗神经网络计算机用于从头分子设计。

J Chem Inf Model. 2018 Jun 25;58(6):1194-1204. doi: 10.1021/acs.jcim.7b00690. Epub 2018 Jun 12.

ProTox-II: a webserver for the prediction of toxicity of chemicals.ProTox-II：一个用于预测化学品毒性的网络服务器。

Nucleic Acids Res. 2018 Jul 2;46(W1):W257-W263. doi: 10.1093/nar/gky318.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

利用深度学习揭示分子中的细胞毒性亚结构。

Revealing cytotoxic substructures in molecules using deep learning.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献