使用混合贝叶斯优化的TabNet架构进行可解释的糖尿病分类

Explainable diabetes classification using hybrid Bayesian-optimized TabNet architecture.

作者信息

Joseph Lionel P, Joseph Erica A, Prasad Ramendra

机构信息

School of Mathematics, Physics, and Computing, University of Southern Queensland, Springfield, QLD, 4300, Australia.

Umanand Prasad School of Medicine and Health Sciences, The University of Fiji, Saweni, Lautoka, Fiji.

出版信息

Comput Biol Med. 2022 Dec;151(Pt A):106178. doi: 10.1016/j.compbiomed.2022.106178. Epub 2022 Oct 6.

DOI:10.1016/j.compbiomed.2022.106178

PMID:36306578

Abstract

Diabetes is a deadly chronic disease that occurs when the pancreas is not able to produce ample insulin or when the body cannot use insulin effectively. If undetected, it may lead to a host of health complications. Hence, accurate and explainable early-stage detection of diabetes is essential for the proper administration of treatment options in leading a healthy and productive life. For this, we developed an interpretable TabNet model tuned via Bayesian optimization (BO). To achieve model-specific interpretability, the attention mechanism of TabNet architecture was used, which offered the local and global model explanations on the influence of the attributes on the outcomes. The model was further explained locally and globally using more robust model-agnostic LIME and SHAP eXplainable Artificial Intelligence (XAI) tools. The proposed model outperformed all benchmarked models by obtaining high accuracy of 92.2% and 99.4% using the Pima Indians diabetes dataset (PIDD) and the early-stage diabetes risk prediction dataset (ESDRPD), respectively. Based on the XAI results, it was clear that the most influential attribute for diabetes classification using PIDD and ESDRPD were Insulin and Polyuria, respectively. The feature importance values registered for insulin was 0.301 (PIDD) and for polyuria 0.206 was registered (ESDRPD). The high accuracy and ancillary interpretability of our objective model is expected to increase end-users trust and confidence in early-stage detection of diabetes.

摘要

糖尿病是一种致命的慢性疾病，当胰腺无法分泌足够的胰岛素，或者身体无法有效利用胰岛素时就会发生。如果未被发现，它可能会导致一系列健康并发症。因此，准确且可解释的糖尿病早期检测对于在健康且有意义的生活中正确实施治疗方案至关重要。为此，我们开发了一种通过贝叶斯优化（BO）进行调优的可解释TabNet模型。为了实现特定于模型的可解释性，使用了TabNet架构的注意力机制，该机制提供了关于属性对结果影响的局部和全局模型解释。使用更强大的与模型无关的LIME和SHAP可解释人工智能（XAI）工具对该模型进行了进一步的局部和全局解释。所提出的模型分别使用皮马印第安人糖尿病数据集（PIDD）和早期糖尿病风险预测数据集（ESDRPD），以92.2%和99.4%的高精度优于所有基准模型。根据XAI结果，很明显，使用PIDD和ESDRPD进行糖尿病分类时最具影响力的属性分别是胰岛素和多尿。胰岛素的特征重要性值在PIDD中为0.301，在ESDRPD中多尿的特征重要性值为0.206。我们目标模型的高精度和辅助可解释性有望提高终端用户对糖尿病早期检测的信任和信心。

相似文献

Explainable diabetes classification using hybrid Bayesian-optimized TabNet architecture.

Comput Biol Med. 2022 Dec;151(Pt A):106178. doi: 10.1016/j.compbiomed.2022.106178. Epub 2022 Oct 6.

Enhanced joint hybrid deep neural network explainable artificial intelligence model for 1-hr ahead solar ultraviolet index prediction.

Comput Methods Programs Biomed. 2023 Nov;241:107737. doi: 10.1016/j.cmpb.2023.107737. Epub 2023 Aug 5.

Utilization of model-agnostic explainable artificial intelligence frameworks in oncology: a narrative review.

Transl Cancer Res. 2022 Oct;11(10):3853-3868. doi: 10.21037/tcr-22-1626.

Model-agnostic explainable artificial intelligence tools for severity prediction and symptom analysis on Indian COVID-19 data.

Front Artif Intell. 2023 Dec 4;6:1272506. doi: 10.3389/frai.2023.1272506. eCollection 2023.

HGSORF: Henry Gas Solubility Optimization-based Random Forest for C-Section prediction and XAI-based cause analysis.

Comput Biol Med. 2022 Aug;147:105671. doi: 10.1016/j.compbiomed.2022.105671. Epub 2022 May 30.

IHCP: interpretable hepatitis C prediction system based on black-box machine learning models.

BMC Bioinformatics. 2023 Sep 6;24(1):333. doi: 10.1186/s12859-023-05456-0.

DeepXplainer: An interpretable deep learning based approach for lung cancer detection using explainable artificial intelligence.

Comput Methods Programs Biomed. 2024 Jan;243:107879. doi: 10.1016/j.cmpb.2023.107879. Epub 2023 Oct 24.

Explainable artificial intelligence model for identifying COVID-19 gene biomarkers.

Comput Biol Med. 2023 Mar;154:106619. doi: 10.1016/j.compbiomed.2023.106619. Epub 2023 Feb 1.

Explainable artificial intelligence in breast cancer detection and risk prediction: A systematic scoping review.

Cancer Innov. 2024 Jul 3;3(5):e136. doi: 10.1002/cai2.136. eCollection 2024 Oct.

Multimodal brain tumor segmentation and classification from MRI scans based on optimized DeepLabV3+ and interpreted networks information fusion empowered with explainable AI.

Comput Biol Med. 2024 Nov;182:109183. doi: 10.1016/j.compbiomed.2024.109183. Epub 2024 Oct 2.

引用本文的文献

Machine learning prediction and interpretability analysis of high-risk chest pain: a study from the MIMIC-IV database.

Front Physiol. 2025 Jun 30;16:1594277. doi: 10.3389/fphys.2025.1594277. eCollection 2025.

An interpretable deep learning framework using FCT-SMOTE and BO-TabNet algorithms for reservoir water sensitivity damage prediction.

Sci Rep. 2025 May 28;15(1):18655. doi: 10.1038/s41598-025-99659-5.

Prediction of cancer cell line-specific synergistic drug combinations based on multi-omics data.

PeerJ. 2025 Feb 25;13:e19078. doi: 10.7717/peerj.19078. eCollection 2025.

Example dependent cost sensitive learning based selective deep ensemble model for customer credit scoring.

Sci Rep. 2025 Feb 18;15(1):6000. doi: 10.1038/s41598-025-89880-7.

An Ensemble Learning Approach Based on TabNet and Machine Learning Models for Cheating Detection in Educational Tests.

Educ Psychol Meas. 2024 Aug;84(4):780-809. doi: 10.1177/00131644231191298. Epub 2023 Aug 21.

Machine learning analysis of thermophysical and thermohydraulic properties in ethylene glycol- and glycerol-based SiO nanofluids.

Sci Rep. 2024 Jun 27;14(1):14829. doi: 10.1038/s41598-024-65411-8.

Personalized venlafaxine dose prediction using artificial intelligence technology: a retrospective analysis based on real-world data.

Int J Clin Pharm. 2024 Aug;46(4):926-936. doi: 10.1007/s11096-024-01729-7. Epub 2024 May 11.

Weighted Bayesian Belief Network for diabetics: a predictive model.

Front Artif Intell. 2024 Apr 11;7:1357121. doi: 10.3389/frai.2024.1357121. eCollection 2024.

Enhanced osteoporotic fracture prediction in postmenopausal women using Bayesian optimization of machine learning models with genetic risk score.

J Bone Miner Res. 2024 May 2;39(4):462-472. doi: 10.1093/jbmr/zjae025.

Explainable Artificial Intelligence Paves the Way in Precision Diagnostics and Biomarker Discovery for the Subclass of Diabetic Retinopathy in Type 2 Diabetics.

Metabolites. 2023 Dec 18;13(12):1204. doi: 10.3390/metabo13121204.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

使用混合贝叶斯优化的TabNet架构进行可解释的糖尿病分类

Explainable diabetes classification using hybrid Bayesian-optimized TabNet architecture.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献