一种用于药物诱导损伤系统生物学建模的集成学习方法。

An ensemble learning approach for modeling the systems biology of drug-induced injury.

机构信息

Research Programme on Biomedical Informatics (GRIB), Hospital del Mar Medical Research Institute (IMIM), DCEXS, Pompeu Fabra University (UPF), Barcelona, Spain.

Department of Toxicogenomics, Maastricht University, Maastricht, The Netherlands.

出版信息

Biol Direct. 2021 Jan 12;16(1):5. doi: 10.1186/s13062-020-00288-x.

DOI:10.1186/s13062-020-00288-x

PMID:33435983

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7805064/

Abstract

BACKGROUND

Drug-induced liver injury (DILI) is an adverse reaction caused by the intake of drugs of common use that produces liver damage. The impact of DILI is estimated to affect around 20 in 100,000 inhabitants worldwide each year. Despite being one of the main causes of liver failure, the pathophysiology and mechanisms of DILI are poorly understood. In the present study, we developed an ensemble learning approach based on different features (CMap gene expression, chemical structures, drug targets) to predict drugs that might cause DILI and gain a better understanding of the mechanisms linked to the adverse reaction.

RESULTS

We searched for gene signatures in CMap gene expression data by using two approaches: phenotype-gene associations data from DisGeNET, and a non-parametric test comparing gene expression of DILI-Concern and No-DILI-Concern drugs (as per DILIrank definitions). The average accuracy of the classifiers in both approaches was 69%. We used chemical structures as features, obtaining an accuracy of 65%. The combination of both types of features produced an accuracy around 63%, but improved the independent hold-out test up to 67%. The use of drug-target associations as feature obtained the best accuracy (70%) in the independent hold-out test.

CONCLUSIONS

When using CMap gene expression data, searching for a specific gene signature among the landmark genes improves the quality of the classifiers, but it is still limited by the intrinsic noise of the dataset. When using chemical structures as a feature, the structural diversity of the known DILI-causing drugs hampers the prediction, which is a similar problem as for the use of gene expression information. The combination of both features did not improve the quality of the classifiers but increased the robustness as shown on independent hold-out tests. The use of drug-target associations as feature improved the prediction, specially the specificity, and the results were comparable to previous research studies.

摘要

背景

药物性肝损伤（DILI）是由常用药物摄入引起的不良反应，导致肝脏损伤。据估计，DILI 的影响每年在全球每 10 万人中约有 20 人受到影响。尽管 DILI 是肝衰竭的主要原因之一，但 DILI 的病理生理学和机制仍知之甚少。在本研究中，我们开发了一种基于不同特征（CMap 基因表达、化学结构、药物靶点）的集成学习方法，以预测可能导致 DILI 的药物，并更好地了解与不良反应相关的机制。

结果

我们通过两种方法在 CMap 基因表达数据中搜索基因特征：DisGeNET 的表型-基因关联数据，以及非参数检验比较 DILI-Concern 和 No-DILI-Concern 药物的基因表达（根据 DILIrank 定义）。这两种方法的分类器平均准确率均为 69%。我们使用化学结构作为特征，获得了 65%的准确率。将这两种类型的特征结合起来，准确率约为 63%，但在独立验证测试中提高到了 67%。使用药物-靶点关联作为特征在独立验证测试中获得了最佳准确率（70%）。

结论

使用 CMap 基因表达数据时，在标志性基因中搜索特定的基因特征可以提高分类器的质量，但仍受到数据集固有噪声的限制。使用化学结构作为特征时，已知导致 DILI 的药物的结构多样性会阻碍预测，这与使用基因表达信息的问题类似。将这两种特征结合起来并没有提高分类器的质量，但增加了稳健性，这在独立验证测试中得到了证明。使用药物-靶点关联作为特征可以提高预测的特异性，特别是特异性，并且结果与以前的研究相似。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4e9c/7805064/f41b6bd0d459/13062_2020_288_Fig1_HTML.jpg

相似文献

An ensemble learning approach for modeling the systems biology of drug-induced injury.一种用于药物诱导损伤系统生物学建模的集成学习方法。

Biol Direct. 2021 Jan 12;16(1):5. doi: 10.1186/s13062-020-00288-x.

Prediction and mechanistic analysis of drug-induced liver injury (DILI) based on chemical structure.基于化学结构预测和机制分析药物性肝损伤（DILI）。

Biol Direct. 2021 Jan 18;16(1):6. doi: 10.1186/s13062-020-00285-0.

Comparing Machine Learning Algorithms for Predicting Drug-Induced Liver Injury (DILI).比较用于预测药物性肝损伤（DILI）的机器学习算法。

Mol Pharm. 2020 Jul 6;17(7):2628-2637. doi: 10.1021/acs.molpharmaceut.0c00326. Epub 2020 Jun 8.

In Silico Prediction of Drug-Induced Liver Injury Based on Ensemble Classifier Method.基于集成分类器方法的药物性肝损伤的计算机预测。

Int J Mol Sci. 2019 Aug 22;20(17):4106. doi: 10.3390/ijms20174106.

Prediction of Drug-Induced Liver Injury: From Molecular Physicochemical Properties and Scaffold Architectures to Machine Learning Approaches.药物性肝损伤的预测：从分子理化性质和支架结构到机器学习方法。

Chem Biol Drug Des. 2024 Aug;104(2):e14607. doi: 10.1111/cbdd.14607.

Predicting Drug-Induced Liver Injury Using Machine Learning on a Diverse Set of Predictors.利用多种预测因素通过机器学习预测药物性肝损伤

Front Pharmacol. 2021 Aug 18;12:648805. doi: 10.3389/fphar.2021.648805. eCollection 2021.

Predictability of drug-induced liver injury by machine learning.基于机器学习预测药物性肝损伤。

Biol Direct. 2020 Feb 13;15(1):3. doi: 10.1186/s13062-020-0259-4.

Ensemble Models Based on QuBiLS-MAS Features and Shallow Learning for the Prediction of Drug-Induced Liver Toxicity: Improving Deep Learning and Traditional Approaches.基于 QuBiLS-MAS 特征和浅层学习的药物性肝毒性预测的集成模型：改进深度学习和传统方法。

Chem Res Toxicol. 2020 Jul 20;33(7):1855-1873. doi: 10.1021/acs.chemrestox.0c00030. Epub 2020 May 14.

Drug properties and host factors contribute to biochemical presentation of drug-induced liver injury: a prediction model from a machine learning approach.药物特性和宿主因素导致药物性肝损伤的生化表现：一种基于机器学习方法的预测模型。

Arch Toxicol. 2021 May;95(5):1793-1803. doi: 10.1007/s00204-021-03013-3. Epub 2021 Mar 5.

Machine-Learning Prediction of Oral Drug-Induced Liver Injury (DILI) via Multiple Features and Endpoints.基于多特征和终点的机器学习预测药物性肝损伤（DILI）。

Biomed Res Int. 2020 May 19;2020:4795140. doi: 10.1155/2020/4795140. eCollection 2020.

引用本文的文献

Machine Learning to Predict Drug-Induced Liver Injury and Its Validation on Failed Drug Candidates in Development.机器学习预测药物性肝损伤及其在研发中失败的候选药物上的验证

Toxics. 2024 May 24;12(6):385. doi: 10.3390/toxics12060385.

State of the Art of Genomic Technology in Toxicology: A Review.毒理学中基因组技术的最新进展：综述。

Int J Mol Sci. 2023 Jun 1;24(11):9618. doi: 10.3390/ijms24119618.

Determining human-coronavirus protein-protein interaction using machine intelligence.利用机器智能确定人类冠状病毒的蛋白质-蛋白质相互作用。

Med Nov Technol Devices. 2023 Jun;18:100228. doi: 10.1016/j.medntd.2023.100228. Epub 2023 Apr 6.

Artificial genetic polymers against human pathologies.人工遗传聚合物治疗人类疾病。

Biol Direct. 2022 Dec 6;17(1):39. doi: 10.1186/s13062-022-00353-7.

p53 regulates expression of nuclear envelope components in cancer cells.p53 调控癌细胞中核膜成分的表达。

Biol Direct. 2022 Dec 2;17(1):38. doi: 10.1186/s13062-022-00349-3.

TOXRIC: a comprehensive database of toxicological data and benchmarks.TOXRIC：一个全面的毒理学数据和基准数据库。

Nucleic Acids Res. 2023 Jan 6;51(D1):D1432-D1445. doi: 10.1093/nar/gkac1074.

Prediction of Adverse Drug Reaction Linked to Protein Targets Using Network-Based Information and Machine Learning.利用基于网络的信息和机器学习预测与蛋白质靶点相关的药物不良反应

Front Bioinform. 2022 Jul 14;2:906644. doi: 10.3389/fbinf.2022.906644. eCollection 2022.

An Algorithm Framework for Drug-Induced Liver Injury Prediction Based on Genetic Algorithm and Ensemble Learning.基于遗传算法和集成学习的药物性肝损伤预测算法框架。

Molecules. 2022 May 12;27(10):3112. doi: 10.3390/molecules27103112.

Predicting Drug-Induced Liver Injury Using Machine Learning on a Diverse Set of Predictors.利用多种预测因素通过机器学习预测药物性肝损伤

Front Pharmacol. 2021 Aug 18;12:648805. doi: 10.3389/fphar.2021.648805. eCollection 2021.

The eTRANSAFE Project on Translational Safety Assessment through Integrative Knowledge Management: Achievements and Perspectives.通过综合知识管理进行转化安全评估的eTRANSAFE项目：成就与展望

Pharmaceuticals (Basel). 2021 Mar 8;14(3):237. doi: 10.3390/ph14030237.

本文引用的文献

Extending the small-molecule similarity principle to all levels of biology with the Chemical Checker.用化学检验器将小分子相似性原理扩展到生物学的各个层次。

Nat Biotechnol. 2020 Sep;38(9):1087-1096. doi: 10.1038/s41587-020-0502-7. Epub 2020 May 18.

Predictability of drug-induced liver injury by machine learning.基于机器学习预测药物性肝损伤。

Biol Direct. 2020 Feb 13;15(1):3. doi: 10.1186/s13062-020-0259-4.

Diverse approaches to predicting drug-induced liver injury using gene-expression profiles.利用基因表达谱预测药物性肝损伤的多种方法。

Biol Direct. 2020 Jan 15;15(1):1. doi: 10.1186/s13062-019-0257-6.

Drug-induced liver injury severity and toxicity (DILIst): binary classification of 1279 drugs by human hepatotoxicity.药物性肝损伤严重程度和毒性（DILIst）：1279 种药物的人类肝毒性的二进制分类。

Drug Discov Today. 2020 Jan;25(1):201-208. doi: 10.1016/j.drudis.2019.09.022. Epub 2019 Nov 1.

Community assessment to advance computational prediction of cancer drug combinations in a pharmacogenomic screen.社区评估在药物基因组筛选中推进癌症药物组合的计算预测。

Nat Commun. 2019 Jun 17;10(1):2674. doi: 10.1038/s41467-019-09799-2.

Insights into mechanisms and severity of drug-induced liver injury via computational systems toxicology approach.通过计算系统毒理学方法洞察药物性肝损伤的机制和严重程度。

Toxicol Lett. 2019 Sep 15;312:22-33. doi: 10.1016/j.toxlet.2019.05.005. Epub 2019 May 4.

Applications of machine learning in drug discovery and development.机器学习在药物发现和开发中的应用。

Nat Rev Drug Discov. 2019 Jun;18(6):463-477. doi: 10.1038/s41573-019-0024-5.

GUILDify v2.0: A Tool to Identify Molecular Networks Underlying Human Diseases, Their Comorbidities and Their Druggable Targets.GUILDify v2.0：一种用于识别人类疾病、其共病和可药物治疗靶点的分子网络的工具。

J Mol Biol. 2019 Jun 14;431(13):2477-2484. doi: 10.1016/j.jmb.2019.02.027. Epub 2019 Mar 7.

Deep Neural Network Models for Predicting Chemically Induced Liver Toxicity Endpoints From Transcriptomic Responses.用于从转录组反应预测化学诱导肝毒性终点的深度神经网络模型

Front Pharmacol. 2019 Feb 5;10:42. doi: 10.3389/fphar.2019.00042. eCollection 2019.

UniProt: a worldwide hub of protein knowledge.UniProt：蛋白质知识的全球枢纽。

Nucleic Acids Res. 2019 Jan 8;47(D1):D506-D515. doi: 10.1093/nar/gky1049.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

一种用于药物诱导损伤系统生物学建模的集成学习方法。

An ensemble learning approach for modeling the systems biology of drug-induced injury.

机构信息

出版信息

BACKGROUND

RESULTS

CONCLUSIONS

背景

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献