利用来自全球化学品统一分类和标签制度（GHS）毒性注释、分子和蛋白质靶标描述符以及Tox21检测读数的异构数据来预测和合理化急性毒性。

Leveraging heterogeneous data from GHS toxicity annotations, molecular and protein target descriptors and Tox21 assay readouts to predict and rationalise acute toxicity.

作者信息

Allen Chad H G, Mervin Lewis H, Mahmoud Samar Y, Bender Andreas

机构信息

Department of Chemistry, Centre for Molecular Informatics, Lensfield Road, Cambridge, CB2 1EW, UK.

出版信息

J Cheminform. 2019 May 31;11(1):36. doi: 10.1186/s13321-019-0356-5.

DOI:10.1186/s13321-019-0356-5

PMID:31152262

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6544914/

Abstract

Despite the increasing knowledge in both the chemical and biological domains the assimilation and exploration of heterogeneous datasets, encoding information about the chemical, bioactivity and phenotypic properties of compounds, remains a challenge due to requirement for overlap between chemicals assayed across the spaces. Here, we have constructed a novel dataset, larger than we have used in prior work, comprising 579 acute oral toxic compounds and 1427 non-toxic compounds derived from regulatory GHS information, along with their corresponding molecular and protein target descriptors and qHTS in vitro assay readouts from the Tox21 project. We found no clear association between the results of a FAFDrugs4 toxicophore screen and the acute oral toxicity classifications for our compound set; and a screen using a subset of the ToxAlerts toxicophores was also of limited utility, with only slight enrichment toward the toxic set (odds ratio of 1.48). We then investigated to what degree toxic and non-toxic compounds could be separated in each of the spaces, to compare their potential contribution to further analyses. Using an LDA projection, we found the largest degree of separation using chemical descriptors (Cohen's d of 1.95) and the lowest degree of separation between toxicity classes using qHTS descriptors (Cohen's d of 0.67). To compare the predictivity of the feature spaces for the toxicity endpoint, we next trained Random Forest (RF) acute oral toxicity classifiers on either molecular, protein target and qHTS descriptors. RFs trained on molecular and protein target descriptors were most predictive, with ROC AUC values of 0.80-0.92 and 0.70-0.85, respectively, across three test sets. RFs trained on both chemical and protein target descriptors combined exhibited similar predictive performance to the single-domain models (ROC AUC of 0.80-0.91). Model interpretability was improved by the inclusion of protein target descriptors, which allow the identification of specific targets (e.g. Retinal dehydrogenase) with literature links to toxic modes of action (e.g. oxidative stress). The dataset compiled in this study has been made available for future application.

摘要

尽管在化学和生物领域的知识不断增加，但由于跨空间检测的化学物质之间需要重叠，对异构数据集的同化和探索（编码有关化合物的化学、生物活性和表型特性的信息）仍然是一项挑战。在这里，我们构建了一个比我们之前工作中使用的数据集更大的新数据集，该数据集包含579种急性口服毒性化合物和1427种从监管GHS信息中获得的无毒化合物，以及它们相应的分子和蛋白质靶点描述符，以及来自Tox21项目的qHTS体外测定读数。我们发现FAFDrugs4毒性基团筛选结果与我们化合物集的急性口服毒性分类之间没有明显关联；使用ToxAlerts毒性基团子集进行的筛选效用也有限，仅对毒性组有轻微富集（优势比为1.48）。然后，我们研究了在每个空间中有毒和无毒化合物可以分离到何种程度，以比较它们对进一步分析的潜在贡献。使用线性判别分析（LDA）投影，我们发现使用化学描述符时分离程度最大（科恩d值为1.95），使用qHTS描述符时毒性类别之间的分离程度最低（科恩d值为0.67）。为了比较特征空间对毒性终点的预测能力，接下来我们在分子、蛋白质靶点和qHTS描述符上训练了随机森林（RF）急性口服毒性分类器。在分子和蛋白质靶点描述符上训练的随机森林最具预测性，在三个测试集上的ROC曲线下面积（AUC）值分别为0.80 - 0.92和0.70 - 0.85。在化学和蛋白质靶点描述符组合上训练的随机森林表现出与单域模型相似的预测性能（ROC AUC为0.80 - 0.91）。通过纳入蛋白质靶点描述符提高了模型的可解释性，这使得能够识别与毒性作用模式（如氧化应激）有文献联系的特定靶点（如视网膜脱氢酶）。本研究汇编的数据集已可供未来应用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c563/6544914/c1ba696e22d6/13321_2019_356_Fig1_HTML.jpg

相似文献

Leveraging heterogeneous data from GHS toxicity annotations, molecular and protein target descriptors and Tox21 assay readouts to predict and rationalise acute toxicity.利用来自全球化学品统一分类和标签制度（GHS）毒性注释、分子和蛋白质靶标描述符以及Tox21检测读数的异构数据来预测和合理化急性毒性。

J Cheminform. 2019 May 31;11(1):36. doi: 10.1186/s13321-019-0356-5.

Use of in vitro HTS-derived concentration-response data as biological descriptors improves the accuracy of QSAR models of in vivo toxicity.将基于体外高通量筛选（HTS）的浓度-反应数据用作生物描述符，可提高体内毒性定量构效关系（QSAR）模型的准确性。

Environ Health Perspect. 2011 Mar;119(3):364-70. doi: 10.1289/ehp.1002476. Epub 2010 Oct 27.

Prediction of chemical-induced acute toxicity using in vitro assay data and chemical structure.利用体外检测数据和化学结构预测化学物质的急性毒性。

Toxicol Appl Pharmacol. 2024 Nov;492:117098. doi: 10.1016/j.taap.2024.117098. Epub 2024 Sep 7.

Improving the prediction of organism-level toxicity through integration of chemical, protein target and cytotoxicity qHTS data.通过整合化学、蛋白质靶点和细胞毒性qHTS数据改进生物体水平毒性预测。

Toxicol Res (Camb). 2016 Mar 3;5(3):883-894. doi: 10.1039/c5tx00406c. eCollection 2016 May 1.

Predicting hepatotoxicity using ToxCast in vitro bioactivity and chemical structure.利用ToxCast体外生物活性和化学结构预测肝毒性。

Chem Res Toxicol. 2015 Apr 20;28(4):738-51. doi: 10.1021/tx500501h. Epub 2015 Mar 9.

Predictive Models for Human Organ Toxicity Based on Bioactivity Data and Chemical Structure.基于生物活性数据和化学结构的人体器官毒性预测模型。

Chem Res Toxicol. 2020 Mar 16;33(3):731-741. doi: 10.1021/acs.chemrestox.9b00305. Epub 2020 Mar 3.

Toxicity prediction using target, interactome, and pathway profiles as descriptors.基于靶标、互作组和通路谱特征进行毒性预测。

Toxicol Lett. 2023 May 15;381:20-26. doi: 10.1016/j.toxlet.2023.04.005. Epub 2023 Apr 13.

ChemBioSim: Enhancing Conformal Prediction of In Vivo Toxicity by Use of Predicted Bioactivities.ChemBioSim：通过预测的生物活性增强体内毒性的一致性预测

J Chem Inf Model. 2021 Jul 26;61(7):3255-3272. doi: 10.1021/acs.jcim.1c00451. Epub 2021 Jun 21.

Discriminating toxicant classes by mode of action. 1. (Eco)toxicity profiles.通过作用方式区分有毒物质类别。1. （生态）毒性概况。

Environ Sci Pollut Res Int. 2006 May;13(3):192-203. doi: 10.1065/espr2006.01.013.

Tox21 Enricher: Web-based Chemical/Biological Functional Annotation Analysis Tool Based on Tox21 Toxicity Screening Platform.Tox21 Enricher：基于 Tox21 毒性筛选平台的网络化学/生物功能注释分析工具。

Mol Inform. 2018 May;37(5):e1700129. doi: 10.1002/minf.201700129. Epub 2018 Jan 29.

引用本文的文献

ProfhEX: AI-based platform for small molecules liability profiling.ProfhEX：用于小分子安全性评估的基于人工智能的平台。

J Cheminform. 2023 Jun 9;15(1):60. doi: 10.1186/s13321-023-00728-6.

Insights into the molecular properties underlying antibacterial activity of prenylated (iso)flavonoids against MRSA.深入了解类异戊烯基（异）黄酮对耐甲氧西林金黄色葡萄球菌（MRSA）的抗菌活性的分子特性。

Sci Rep. 2021 Jul 9;11(1):14180. doi: 10.1038/s41598-021-92964-9.

A cross-industry collaboration to assess if acute oral toxicity (Q)SAR models are fit-for-purpose for GHS classification and labelling.跨行业合作评估急性口服毒性（QSAR）模型是否适用于 GHS 分类和标签。

Regul Toxicol Pharmacol. 2021 Mar;120:104843. doi: 10.1016/j.yrtph.2020.104843. Epub 2020 Dec 17.

本文引用的文献

Information-Derived Mechanistic Hypotheses for Structural Cardiotoxicity.信息衍生的结构心脏毒性机制假说。

Chem Res Toxicol. 2018 Nov 19;31(11):1119-1127. doi: 10.1021/acs.chemrestox.8b00159. Epub 2018 Oct 17.

Regulatory crosstalk between the oxidative stress-related transcription factor Nfe2l2/Nrf2 and mitochondria.氧化应激相关转录因子 Nfe2l2/Nrf2 与线粒体之间的调控串扰。

Toxicol Appl Pharmacol. 2018 Nov 15;359:24-33. doi: 10.1016/j.taap.2018.09.014. Epub 2018 Sep 18.

Toxicol Res (Camb). 2016 Mar 3;5(3):883-894. doi: 10.1039/c5tx00406c. eCollection 2016 May 1.

Accelerating the Pace of Chemical Risk Assessment.加速化学风险评估步伐。

Chem Res Toxicol. 2018 May 21;31(5):287-290. doi: 10.1021/acs.chemrestox.7b00339. Epub 2018 Apr 6.

Development of Decision Forest Models for Prediction of Drug-Induced Liver Injury in Humans Using A Large Set of FDA-approved Drugs.利用大型 FDA 批准药物集开发用于预测人类药物性肝损伤的决策森林模型。

Sci Rep. 2017 Dec 11;7(1):17311. doi: 10.1038/s41598-017-17701-7.

FAF-Drugs4: free ADME-tox filtering computations for chemical biology and early stages drug discovery.FAF-Drugs4：用于化学生物学和早期药物发现的免费 ADME-tox 筛选计算。

Bioinformatics. 2017 Nov 15;33(22):3658-3660. doi: 10.1093/bioinformatics/btx491.

Orthologue chemical space and its influence on target prediction.同源化学空间及其对靶标预测的影响。

Bioinformatics. 2018 Jan 1;34(1):72-79. doi: 10.1093/bioinformatics/btx525.

APOBEC3B, a molecular driver of mutagenesis in human cancers.载脂蛋白B mRNA编辑酶催化多肽样3B（APOBEC3B），人类癌症中诱变的分子驱动因素。

Cell Biosci. 2017 May 30;7:29. doi: 10.1186/s13578-017-0156-4. eCollection 2017.

Alarms about structural alerts.关于结构警示的警报。

Green Chem. 2016 Aug 21;18(16):4348-4360. doi: 10.1039/C6GC01492E. Epub 2016 Jun 28.

Metabolic inhibitors accentuate the anti-tumoral effect of HDAC5 inhibition.代谢抑制剂可增强HDAC5抑制的抗肿瘤作用。

Oncogene. 2017 Aug 24;36(34):4859-4874. doi: 10.1038/onc.2017.103. Epub 2017 Apr 17.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

利用来自全球化学品统一分类和标签制度（GHS）毒性注释、分子和蛋白质靶标描述符以及Tox21检测读数的异构数据来预测和合理化急性毒性。

Leveraging heterogeneous data from GHS toxicity annotations, molecular and protein target descriptors and Tox21 assay readouts to predict and rationalise acute toxicity.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献