Demirbaş Ayşe Ayyüce, Üzen Hüseyin, Fırat Hüseyin
Ankara, Turkey.
Department of Computer Engineering, Faculty of Engineering, Bingol University, Bingol, Turkey.
Health Inf Sci Syst. 2024 Apr 28;12(1):32. doi: 10.1007/s13755-024-00290-x. eCollection 2024 Dec.
Gastrointestinal (GI) disorders, encompassing conditions like cancer and Crohn's disease, pose a significant threat to public health. Endoscopic examinations have become crucial for diagnosing and treating these disorders efficiently. However, the subjective nature of manual evaluations by gastroenterologists can lead to potential errors in disease classification. In addition, the difficulty of diagnosing diseased tissues in GI and the high similarity between classes made the subject a difficult area. Automated classification systems that use artificial intelligence to solve these problems have gained traction. Automatic detection of diseases in medical images greatly benefits in the diagnosis of diseases and reduces the time of disease detection. In this study, we suggested a new architecture to enable research on computer-assisted diagnosis and automated disease detection in GI diseases. This architecture, called Spatial-Attention ConvMixer (SAC), further developed the patch extraction technique used as the basis of the ConvMixer architecture with a spatial attention mechanism (SAM). The SAM enables the network to concentrate selectively on the most informative areas, assigning importance to each spatial location within the feature maps. We employ the Kvasir dataset to assess the accuracy of classifying GI illnesses using the SAC architecture. We compare our architecture's results with Vanilla ViT, Swin Transformer, ConvMixer, MLPMixer, ResNet50, and SqueezeNet models. Our SAC method gets 93.37% accuracy, while the other architectures get respectively 79.52%, 74.52%, 92.48%, 63.04%, 87.44%, and 85.59%. The proposed spatial attention block improves the accuracy of the ConvMixer architecture on the Kvasir, outperforming the state-of-the-art methods with an accuracy rate of 93.37%.
胃肠道(GI)疾病,包括癌症和克罗恩病等,对公众健康构成重大威胁。内镜检查已成为有效诊断和治疗这些疾病的关键。然而,胃肠病学家进行手动评估的主观性可能导致疾病分类中出现潜在错误。此外,胃肠道中患病组织的诊断难度以及类别之间的高度相似性使得该领域成为一个难题。利用人工智能解决这些问题的自动分类系统已受到关注。医学图像中疾病的自动检测在疾病诊断中大有裨益,并减少了疾病检测时间。在本研究中,我们提出了一种新架构,以促进对胃肠道疾病的计算机辅助诊断和自动疾病检测的研究。这种架构称为空间注意力ConvMixer(SAC),它通过空间注意力机制(SAM)进一步发展了用作ConvMixer架构基础的补丁提取技术。SAM使网络能够有选择地专注于信息最丰富的区域,为特征图内的每个空间位置赋予重要性。我们使用Kvasir数据集来评估使用SAC架构对胃肠道疾病进行分类的准确性。我们将我们架构的结果与香草视觉Transformer(Vanilla ViT)、Swin Transformer、ConvMixer、MLP Mixer、ResNet50和SqueezeNet模型进行比较。我们的SAC方法准确率达到93.37%,而其他架构的准确率分别为79.52%、74.52%、92.48%、63.04%、87.44%和85.59%。所提出的空间注意力模块提高了ConvMixer架构在Kvasir数据集上的准确性,以93.37%的准确率优于当前最先进的方法。