• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

综合肝毒性预测:整合机器学习与深度学习的集成模型

Comprehensive hepatotoxicity prediction: ensemble model integrating machine learning and deep learning.

作者信息

Khan Muhammad Zafar Irshad, Ren Jia-Nan, Cao Cheng, Ye Hong-Yu-Xiang, Wang Hao, Guo Ya-Min, Yang Jin-Rong, Chen Jian-Zhong

机构信息

College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, China.

Polytechnic Institute, Zhejiang University, Hangzhou, China.

出版信息

Front Pharmacol. 2024 Aug 21;15:1441587. doi: 10.3389/fphar.2024.1441587. eCollection 2024.

DOI:10.3389/fphar.2024.1441587
PMID:39234116
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11373136/
Abstract

BACKGROUND

Chemicals may lead to acute liver injuries, posing a serious threat to human health. Achieving the precise safety profile of a compound is challenging due to the complex and expensive testing procedures. In silico approaches will aid in identifying the potential risk of drug candidates in the initial stage of drug development and thus mitigating the developmental cost.

METHODS

In current studies, QSAR models were developed for hepatotoxicity predictions using the ensemble strategy to integrate machine learning (ML) and deep learning (DL) algorithms using various molecular features. A large dataset of 2588 chemicals and drugs was randomly divided into training (80%) and test (20%) sets, followed by the training of individual base models using diverse machine learning or deep learning based on three different kinds of descriptors and fingerprints. Feature selection approaches were employed to proceed with model optimizations based on the model performance. Hybrid ensemble approaches were further utilized to determine the method with the best performance.

RESULTS

The voting ensemble classifier emerged as the optimal model, achieving an excellent prediction accuracy of 80.26%, AUC of 82.84%, and recall of over 93% followed by bagging and stacking ensemble classifiers method. The model was further verified by an external test set, internal 10-fold cross-validation, and rigorous benchmark training, exhibiting much better reliability than the published models.

CONCLUSION

The proposed ensemble model offers a dependable assessment with a good performance for the prediction regarding the risk of chemicals and drugs to induce liver damage.

摘要

背景

化学物质可能导致急性肝损伤,对人类健康构成严重威胁。由于测试程序复杂且昂贵,要获得化合物准确的安全性概况具有挑战性。计算机模拟方法将有助于在药物开发的初始阶段识别候选药物的潜在风险,从而降低开发成本。

方法

在当前研究中,使用集成策略开发了用于肝毒性预测的QSAR模型,该策略使用各种分子特征来整合机器学习(ML)和深度学习(DL)算法。将包含2588种化学物质和药物的大型数据集随机分为训练集(80%)和测试集(20%),然后使用基于三种不同描述符和指纹的各种机器学习或深度学习方法训练单个基础模型。基于模型性能采用特征选择方法进行模型优化。进一步利用混合集成方法来确定性能最佳的方法。

结果

投票集成分类器成为最优模型,预测准确率达到80.26%,AUC为82.84%,召回率超过93%,其次是装袋和堆叠集成分类器方法。该模型通过外部测试集、内部10折交叉验证和严格的基准训练进一步验证,表现出比已发表模型更高的可靠性。

结论

所提出的集成模型为预测化学物质和药物诱导肝损伤的风险提供了可靠的评估,性能良好。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8033/11373136/f73c9f5883bd/fphar-15-1441587-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8033/11373136/e3e678a2f98d/fphar-15-1441587-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8033/11373136/542c7ccb0bab/fphar-15-1441587-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8033/11373136/26441c992777/fphar-15-1441587-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8033/11373136/81287f5a264d/fphar-15-1441587-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8033/11373136/fbe9247401d8/fphar-15-1441587-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8033/11373136/f73c9f5883bd/fphar-15-1441587-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8033/11373136/e3e678a2f98d/fphar-15-1441587-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8033/11373136/542c7ccb0bab/fphar-15-1441587-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8033/11373136/26441c992777/fphar-15-1441587-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8033/11373136/81287f5a264d/fphar-15-1441587-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8033/11373136/fbe9247401d8/fphar-15-1441587-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8033/11373136/f73c9f5883bd/fphar-15-1441587-g006.jpg

相似文献

1
Comprehensive hepatotoxicity prediction: ensemble model integrating machine learning and deep learning.综合肝毒性预测:整合机器学习与深度学习的集成模型
Front Pharmacol. 2024 Aug 21;15:1441587. doi: 10.3389/fphar.2024.1441587. eCollection 2024.
2
Prediction of hERG potassium channel blockage using ensemble learning methods and molecular fingerprints.使用集成学习方法和分子指纹预测人醚 - 去极化激活的钾离子通道阻滞
Toxicol Lett. 2020 Oct 10;332:88-96. doi: 10.1016/j.toxlet.2020.07.003. Epub 2020 Jul 3.
3
Ensemble Models Based on QuBiLS-MAS Features and Shallow Learning for the Prediction of Drug-Induced Liver Toxicity: Improving Deep Learning and Traditional Approaches.基于 QuBiLS-MAS 特征和浅层学习的药物性肝毒性预测的集成模型:改进深度学习和传统方法。
Chem Res Toxicol. 2020 Jul 20;33(7):1855-1873. doi: 10.1021/acs.chemrestox.0c00030. Epub 2020 May 14.
4
Prediction of the Blood-Brain Barrier (BBB) Permeability of Chemicals Based on Machine-Learning and Ensemble Methods.基于机器学习和集成方法的化学物质血脑屏障(BBB)渗透性预测。
Chem Res Toxicol. 2021 Jun 21;34(6):1456-1467. doi: 10.1021/acs.chemrestox.0c00343. Epub 2021 May 28.
5
Ensemble Learning for Disease Prediction: A Review.用于疾病预测的集成学习:综述
Healthcare (Basel). 2023 Jun 20;11(12):1808. doi: 10.3390/healthcare11121808.
6
A new hybrid ensemble machine-learning model for severity risk assessment and post-COVID prediction system.一种新的混合集成机器学习模型,用于严重程度风险评估和 COVID 后预测系统。
Math Biosci Eng. 2022 Apr 13;19(6):6102-6123. doi: 10.3934/mbe.2022285.
7
Improving Individual Brain Age Prediction Using an Ensemble Deep Learning Framework.使用集成深度学习框架改进个体脑龄预测
Front Psychiatry. 2021 Mar 23;12:626677. doi: 10.3389/fpsyt.2021.626677. eCollection 2021.
8
Comparative analysis of supervised learning algorithms for prediction of cardiovascular diseases.监督学习算法在心血管疾病预测中的比较分析。
Technol Health Care. 2024;32(S1):241-251. doi: 10.3233/THC-248021.
9
A deep learning-based multi-model ensemble method for cancer prediction.基于深度学习的癌症预测多模型集成方法。
Comput Methods Programs Biomed. 2018 Jan;153:1-9. doi: 10.1016/j.cmpb.2017.09.005. Epub 2017 Sep 14.
10
Exploiting ensemble learning to improve prediction of phospholipidosis inducing potential.利用集成学习提高磷脂蓄积诱导潜力预测。
J Theor Biol. 2019 Oct 21;479:37-47. doi: 10.1016/j.jtbi.2019.07.009. Epub 2019 Jul 13.

引用本文的文献

1
Development and validation of an explainable machine learning model for predicting postoperative pulmonary complications after lung cancer surgery: a machine learning study.用于预测肺癌手术后肺部并发症的可解释机器学习模型的开发与验证:一项机器学习研究
EClinicalMedicine. 2025 Aug 1;86:103386. doi: 10.1016/j.eclinm.2025.103386. eCollection 2025 Aug.
2
Impact of halogenation on scaffold toxicity assessed using HD-GEM machine learning model.使用HD-GEM机器学习模型评估卤化对支架毒性的影响。
Brief Bioinform. 2025 Jul 2;26(4). doi: 10.1093/bib/bbaf347.
3
Leveraging machine learning models in evaluating ADMET properties for drug discovery and development.

本文引用的文献

1
Deep Learning Algorithm Based on Molecular Fingerprint for Prediction of Drug-Induced Liver Injury.基于分子指纹的深度学习算法预测药物性肝损伤
Toxicology. 2024 Feb;502:153736. doi: 10.1016/j.tox.2024.153736. Epub 2024 Feb 1.
2
The last resort requirement under REACH: From principle to practice.REACH 法规下的最后手段要求:从原则到实践。
Regul Toxicol Pharmacol. 2024 Feb;147:105557. doi: 10.1016/j.yrtph.2023.105557. Epub 2023 Dec 23.
3
Predicting anti-SARS-CoV-2 activities of chemical compounds using machine learning models.
利用机器学习模型评估药物发现与开发中的ADMET性质。
ADMET DMPK. 2025 Jun 7;13(3):2772. doi: 10.5599/admet.2772. eCollection 2025.
使用机器学习模型预测化合物的抗SARS-CoV-2活性。
Artif Intell Chem. 2023 Dec;1(2). doi: 10.1016/j.aichem.2023.100029. Epub 2023 Nov 19.
4
Machine Learning-Based Prediction of Drug-Induced Hepatotoxicity: An OvA-QSTR Approach.基于机器学习的药物性肝毒性预测:一种 OvA-QSTR 方法。
J Chem Inf Model. 2023 Aug 14;63(15):4602-4614. doi: 10.1021/acs.jcim.3c00687. Epub 2023 Jul 26.
5
A Study on Different Deep Learning Algorithms Used in Deep Neural Nets: MLP SOM and DBN.深度神经网络中使用的不同深度学习算法研究:多层感知器、自组织映射和深度信念网络。
Wirel Pers Commun. 2023;128(4):2913-2936. doi: 10.1007/s11277-022-10079-4. Epub 2022 Oct 19.
6
Application of ToxCast/Tox21 data for toxicity mechanism-based evaluation and prioritization of environmental chemicals: Perspective and limitations.利用 ToxCast/Tox21 数据进行基于毒性机制的评估和优先排序环境化学物质:观点和局限性。
Toxicol In Vitro. 2022 Oct;84:105451. doi: 10.1016/j.tiv.2022.105451. Epub 2022 Jul 31.
7
Reliable CA-(Q)SAR generation based on entropy weight optimized by grid search and correction factors.基于网格搜索和校正因子优化的熵权可靠 CA-(Q)SAR 生成。
Comput Biol Med. 2022 Jul;146:105573. doi: 10.1016/j.compbiomed.2022.105573. Epub 2022 Apr 30.
8
Prediction reliability of QSAR models: an overview of various validation tools.QSAR 模型的预测可靠性:各种验证工具概述。
Arch Toxicol. 2022 May;96(5):1279-1295. doi: 10.1007/s00204-022-03252-y. Epub 2022 Mar 10.
9
Complex features extraction with deep learning model for the detection of COVID19 from CT scan images using ensemble based machine learning approach.基于集成机器学习方法,利用深度学习模型从CT扫描图像中提取复杂特征以检测新型冠状病毒肺炎。
Expert Syst Appl. 2022 Jun 1;195:116554. doi: 10.1016/j.eswa.2022.116554. Epub 2022 Feb 4.
10
Predictive Model for Drug-Induced Liver Injury Using Deep Neural Networks Based on Substructure Space.基于子结构空间的深度神经网络用于药物性肝损伤的预测模型。
Molecules. 2021 Dec 13;26(24):7548. doi: 10.3390/molecules26247548.