Shaheen Ifra, Javaid Nadeem, Alrajeh Nabil, Asim Yousra, Akber Syed Muhammad Abrar
ComSens Lab, International Graduate School of Artificial Intelligence, National Yunlin University of Science and Technology, Douliou, Yunlin, 64002, Taiwan.
Department of Biomedical Technology, College of Applied Medical Sciences, King Saud University, Riyadh, 11633, Saudi Arabia.
Med Biol Eng Comput. 2025 Mar 4. doi: 10.1007/s11517-025-03338-6.
Diabetes is a metabolic condition that can lead to chronic illness and organ failure if it remains untreated. Accurate detection is essential to reduce these risks at an early stage. Recent advancements in predictive models show promising results. However, these models exhibit inadequate accuracy, struggle with class imbalance, and lack interpretability of the decision-making process. To overcome these issues, we propose two novel deep models for early and accurate diabetes prediction: LeDNet (inspired by LeNet and the Dual Attention Network) and HiDenNet (influenced by the highway network and DenseNet). The models are trained using the Diabetes Health Indicators dataset, which has an inherent class imbalance problem and results in biased predictions. This imbalance is mitigated by employing the majority-weighted minority over-sampling technique. Experimental findings demonstrate that LeDNet achieves an F1-score of 85%, recall of 84%, accuracy of 85%, and precision of 86%. Similarly, HiDenNet achieves accuracy, F1-score, recall, and precision of 85%, 86%, 86%, and 86%, respectively. Both proposed models outperform the state-of-the-art deep learning (DL) models. K-fold cross-validation is applied to ensure models' stability at different data splits. Local interpretable model-agnostic explanations and Shapley additive explanations techniques are utilized to enhance interpretability and overcome the traditional black-box nature of DL models. By providing both local and global insights into feature contributions, these explainable artificial intelligence techniques provide transparency to LeDNet and HiDenNet in diabetes prediction. LeDNet and HiDenNet not only improve decision-making transparency but also enhance diabetes prediction accuracy, making them reliable tools for clinical decision-making and early diagnosis.
糖尿病是一种代谢性疾病,如果不进行治疗,可能会导致慢性病和器官衰竭。早期准确检测对于降低这些风险至关重要。预测模型的最新进展显示出了有前景的结果。然而,这些模型准确性不足,难以处理类别不平衡问题,并且缺乏决策过程的可解释性。为了克服这些问题,我们提出了两种用于早期准确糖尿病预测的新型深度模型:LeDNet(受LeNet和双注意力网络启发)和HiDenNet(受高速公路网络和DenseNet影响)。这些模型使用糖尿病健康指标数据集进行训练,该数据集存在固有的类别不平衡问题,会导致有偏差的预测。通过采用多数加权少数过采样技术来缓解这种不平衡。实验结果表明,LeDNet的F1分数为85%,召回率为84%,准确率为85%,精确率为86%。同样,HiDenNet的准确率、F1分数、召回率和精确率分别为85%、86%、86%和86%。所提出的两个模型均优于当前最先进的深度学习(DL)模型。应用K折交叉验证以确保模型在不同数据划分下的稳定性。利用局部可解释模型无关解释和Shapley加法解释技术来增强可解释性,并克服DL模型传统的黑箱性质。通过提供关于特征贡献的局部和全局见解,这些可解释人工智能技术为糖尿病预测中的LeDNet和HiDenNet提供了透明度。LeDNet和HiDenNet不仅提高了决策透明度,还增强了糖尿病预测准确性,使其成为临床决策和早期诊断的可靠工具。