• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用机器学习模型预测化学物质呼吸毒性的Optuna优化。

OPTUNA optimization for predicting chemical respiratory toxicity using ML models.

作者信息

Shehab Eman, Nayel Hamada, Taha Mohamed

机构信息

Department of Computer Science, Faculty of Computers and Artificial Intelligence, University of Sadat City, Sadat, Egypt.

Department of Computer Science, Faculty of Computers and Artificial Intelligence, Benha University, Benha, Egypt.

出版信息

J Comput Aided Mol Des. 2025 Apr 26;39(1):21. doi: 10.1007/s10822-025-00597-1.

DOI:10.1007/s10822-025-00597-1
PMID:40285895
Abstract

Predicting molecular toxicity is an important stage in the process of drug discovery. It is directly related to medical destiny and human health. This paper presents an enhanced model for chemical respiratory toxicity prediction. It used a combination of molecular descriptors and term frequency - inverse document frequency (TF-IDF) based models with different machine learning algorithms. To address class imbalance, SMOTE is applied. Appropriate hyper-parameter tuning is required to generate a better system with a classifier. So, we adjusted the hyper-parameters of various models and used the adjusted parameters to train the model. We tuned hyper-parameters using OPTUNA. Internal and external validation were used to confirm the models' performance. According to the results, the model's internal validation accuracy and AUC using the random forest approach were 88.6% and 93.2%. For external validation, the model's accuracy value using random forest and Gradient Boosting Classifier were 92.2% with AUC 97%. Comparing these results with previous studies shows that our model performs better compared to them.

摘要

预测分子毒性是药物发现过程中的一个重要阶段。它直接关系到医学命运和人类健康。本文提出了一种用于化学物质呼吸毒性预测的增强模型。该模型结合了分子描述符和基于词频 - 逆文档频率(TF-IDF)的模型,并采用了不同的机器学习算法。为了解决类别不平衡问题,应用了SMOTE算法。需要进行适当的超参数调整,以使用分类器生成更好的系统。因此,我们调整了各种模型的超参数,并使用调整后的参数训练模型。我们使用OPTUNA调整超参数。通过内部和外部验证来确认模型的性能。根据结果,使用随机森林方法时模型的内部验证准确率和AUC分别为88.6%和93.2%。对于外部验证,使用随机森林和梯度提升分类器时模型的准确率值为92.2%,AUC为97%。将这些结果与先前的研究进行比较表明,我们的模型比它们表现更好。

相似文献

1
OPTUNA optimization for predicting chemical respiratory toxicity using ML models.使用机器学习模型预测化学物质呼吸毒性的Optuna优化。
J Comput Aided Mol Des. 2025 Apr 26;39(1):21. doi: 10.1007/s10822-025-00597-1.
2
Optimization of diabetes prediction methods based on combinatorial balancing algorithm.基于组合平衡算法的糖尿病预测方法优化。
Nutr Diabetes. 2024 Aug 14;14(1):63. doi: 10.1038/s41387-024-00324-z.
3
Explore the factors related to the death of offspring under age five and appraise the hazard of child mortality using machine learning techniques in Bangladesh.在孟加拉国,利用机器学习技术探究与五岁以下儿童死亡相关的因素,并评估儿童死亡风险。
BMC Public Health. 2025 Jan 29;25(1):360. doi: 10.1186/s12889-025-21460-w.
4
Development and validation of a machine learning-based predictive model for assessing the 90-day prognostic outcome of patients with spontaneous intracerebral hemorrhage.基于机器学习的预测模型评估自发性脑出血患者 90 天预后结局的开发与验证。
J Transl Med. 2024 Mar 4;22(1):236. doi: 10.1186/s12967-024-04896-3.
5
Tree-Based Machine Learning Models with Optuna in Predicting Impedance Values for Circuit Analysis.基于树的机器学习模型与Optuna用于预测电路分析的阻抗值。
Micromachines (Basel). 2023 Jan 20;14(2):265. doi: 10.3390/mi14020265.
6
Refining heart disease prediction accuracy using hybrid machine learning techniques with novel metaheuristic algorithms.利用具有新颖元启发式算法的混合机器学习技术提高心脏病预测准确性。
Int J Cardiol. 2024 Dec 1;416:132506. doi: 10.1016/j.ijcard.2024.132506. Epub 2024 Aug 30.
7
Prediction and feature selection of low birth weight using machine learning algorithms.利用机器学习算法预测和选择低出生体重。
J Health Popul Nutr. 2024 Oct 12;43(1):157. doi: 10.1186/s41043-024-00647-8.
8
Development and validation of a prediction model for coronary heart disease risk in depressed patients aged 20 years and older using machine learning algorithms.使用机器学习算法开发并验证针对20岁及以上抑郁症患者冠心病风险的预测模型。
Front Cardiovasc Med. 2025 Jan 9;11:1504957. doi: 10.3389/fcvm.2024.1504957. eCollection 2024.
9
ADMET Evaluation in Drug Discovery. 18. Reliable Prediction of Chemical-Induced Urinary Tract Toxicity by Boosting Machine Learning Approaches.药物发现中的 ADMET 评估。18. 通过机器学习方法的提升实现可靠的化学诱导的泌尿道毒性预测。
Mol Pharm. 2017 Nov 6;14(11):3935-3953. doi: 10.1021/acs.molpharmaceut.7b00631. Epub 2017 Oct 27.
10
Hospital mortality prediction in traumatic injuries patients: comparing different SMOTE-based machine learning algorithms.创伤性损伤患者的医院死亡率预测:比较不同基于 SMOTE 的机器学习算法。
BMC Med Res Methodol. 2023 Apr 22;23(1):101. doi: 10.1186/s12874-023-01920-w.

本文引用的文献

1
TISBE: A Public Web Platform for the Consensus-Based Explainable Prediction of Developmental Toxicity.基于共识的发育毒性可解释性预测公共网络平台 TISBE
Chem Res Toxicol. 2024 Feb 19;37(2):323-339. doi: 10.1021/acs.chemrestox.3c00310. Epub 2024 Jan 10.
2
Integrating QSAR modelling and deep learning in drug discovery: the emergence of deep QSAR.将 QSAR 建模与深度学习整合到药物发现中:深 QSAR 的出现。
Nat Rev Drug Discov. 2024 Feb;23(2):141-155. doi: 10.1038/s41573-023-00832-0. Epub 2023 Dec 8.
3
Machine learning-based approach for efficient prediction of toxicity of chemical gases using feature selection.
基于机器学习的方法,使用特征选择对化学气体毒性进行高效预测。
J Hazard Mater. 2023 Aug 5;455:131616. doi: 10.1016/j.jhazmat.2023.131616. Epub 2023 May 10.
4
A review on longitudinal data analysis with random forest.随机森林的纵向数据分析综述。
Brief Bioinform. 2023 Mar 19;24(2). doi: 10.1093/bib/bbad002.
5
An Explainable Supervised Machine Learning Model for Predicting Respiratory Toxicity of Chemicals Using Optimal Molecular Descriptors.一种使用最优分子描述符预测化学物质呼吸毒性的可解释监督机器学习模型。
Pharmaceutics. 2022 Apr 11;14(4):832. doi: 10.3390/pharmaceutics14040832.
6
Drug repositioning: Progress and challenges in drug discovery for various diseases.药物重定位:各种疾病药物发现中的进展和挑战。
Eur J Med Chem. 2022 Apr 15;234:114239. doi: 10.1016/j.ejmech.2022.114239. Epub 2022 Feb 28.
7
In silico methods and tools for drug discovery.基于计算机的药物研发方法和工具。
Comput Biol Med. 2021 Oct;137:104851. doi: 10.1016/j.compbiomed.2021.104851. Epub 2021 Sep 8.
8
In silico prediction of chemical reproductive toxicity using machine learning.利用机器学习进行化学生殖毒性的计算预测。
J Appl Toxicol. 2019 Jun;39(6):844-854. doi: 10.1002/jat.3772. Epub 2019 Jan 27.
9
Development and evaluation of in silico prediction model for drug-induced respiratory toxicity by using naïve Bayes classifier method.基于朴素贝叶斯分类器方法建立药物性呼吸毒性的计算机预测模型:开发与评价
Food Chem Toxicol. 2018 Nov;121:593-603. doi: 10.1016/j.fct.2018.09.051. Epub 2018 Sep 25.
10
QSAR modeling of cumulative environmental end-points for the prioritization of hazardous chemicals.用于优先考虑危险化学品的累积环境终点的定量构效关系建模。
Environ Sci Process Impacts. 2018 Jan 24;20(1):38-47. doi: 10.1039/c7em00519a.