Waheed Zafran, Gui Jinsong, Heyat Md Belal Bin, Parveen Saba, Hayat Mohd Ammar Bin, Iqbal Muhammad Shahid, Aya Zouheir, Nawabi Awais Khan, Sawan Mohamad
School of Computer Science and Engineering, Central South University, China.
School of Electronic Information, Central South University, China.
Comput Methods Programs Biomed. 2025 Mar;260:108579. doi: 10.1016/j.cmpb.2024.108579. Epub 2024 Dec 30.
Gastrointestinal (GI) diseases pose significant challenges for healthcare systems, largely due to the complexities involved in their detection and treatment. Despite the advancements in deep neural networks, their high computational demands hinder their practical use in clinical environments.
This study aims to address the computational inefficiencies of deep neural networks by proposing a lightweight model that integrates model compression techniques, ConvLSTM layers, and ConvNext Blocks, all optimized through Knowledge Distillation (KD).
A dataset of 6000 endoscopic images of various GI diseases was utilized. Advanced image preprocessing techniques, including adaptive noise reduction and image detail enhancement, were employed to improve accuracy and interpretability. The model's performance was assessed in terms of accuracy, computational cost, and disk space usage.
The proposed lightweight model achieved an exceptional overall accuracy of 99.38 %. It operates efficiently with a computational cost of 0.61 GFLOPs and occupies only 3.09 MB of disk space. Additionally, Grad-CAM visualizations demonstrated enhanced model saliency and interpretability, offering insights into the decision-making process of the model post-KD.
The proposed model represents a significant advancement in the diagnosis of GI diseases. It provides a cost-effective and efficient alternative to traditional deep neural network methods, overcoming their computational limitations and contributing valuable insights for improved clinical application.
胃肠道(GI)疾病给医疗系统带来了重大挑战,这主要是由于其检测和治疗过程涉及的复杂性。尽管深度神经网络取得了进展,但其高计算需求阻碍了它们在临床环境中的实际应用。
本研究旨在通过提出一种轻量级模型来解决深度神经网络的计算效率低下问题,该模型集成了模型压缩技术、卷积长短期记忆(ConvLSTM)层和卷积神经网络(ConvNext)模块,所有这些都通过知识蒸馏(KD)进行了优化。
使用了一个包含6000张各种胃肠道疾病内镜图像的数据集。采用了先进的图像预处理技术,包括自适应降噪和图像细节增强,以提高准确性和可解释性。从准确性、计算成本和磁盘空间使用方面评估了该模型的性能。
所提出的轻量级模型实现了99.38%的卓越总体准确率。它以0.61 GFLOPs的计算成本高效运行,仅占用3.09 MB的磁盘空间。此外,梯度加权类激活映射(Grad-CAM)可视化展示了增强的模型显著性和可解释性,为模型在知识蒸馏后的决策过程提供了见解。
所提出的模型代表了胃肠道疾病诊断方面的重大进展。它为传统深度神经网络方法提供了一种经济高效的替代方案,克服了它们的计算限制,并为改进临床应用提供了有价值的见解。