一种基于内镜图像的胃肠道疾病分类的新型多特征融合方法

A Novel Multi-Feature Fusion Method for Classification of Gastrointestinal Diseases Using Endoscopy Images.

作者信息

Ramamurthy Karthik, George Timothy Thomas, Shah Yash, Sasidhar Parasa

机构信息

Centre for Cyber Physical Systems, School of Electronics Engineering, Vellore Institute of Technology, Chennai 600127, India.

School of Computer Science and Engineering, Vellore Institute of Technology, Chennai 600127, India.

出版信息

Diagnostics (Basel). 2022 Sep 26;12(10):2316. doi: 10.3390/diagnostics12102316.

DOI:10.3390/diagnostics12102316

PMID:36292006

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9600128/

Abstract

The first step in the diagnosis of gastric abnormalities is the detection of various abnormalities in the human gastrointestinal tract. Manual examination of endoscopy images relies on a medical practitioner's expertise to identify inflammatory regions on the inner surface of the gastrointestinal tract. The length of the alimentary canal and the large volume of images obtained from endoscopic procedures make traditional detection methods time consuming and laborious. Recently, deep learning architectures have achieved better results in the classification of endoscopy images. However, visual similarities between different portions of the gastrointestinal tract pose a challenge for effective disease detection. This work proposes a novel system for the classification of endoscopy images by focusing on feature mining through convolutional neural networks (CNN). The model presented is built by combining a state-of-the-art architecture (i.e., EfficientNet B0) with a custom-built CNN architecture named Effimix. The proposed Effimix model employs a combination of squeeze and excitation layers and self-normalising activation layers for precise classification of gastrointestinal diseases. Experimental observations on the HyperKvasir dataset confirm the effectiveness of the proposed architecture for the classification of endoscopy images. The proposed model yields an accuracy of 97.99%, with an F1 score, precision, and recall of 97%, 97%, and 98%, respectively, which is significantly higher compared to the existing works.

摘要

胃部异常诊断的第一步是检测人体胃肠道中的各种异常情况。内窥镜图像的人工检查依赖于医生的专业知识来识别胃肠道内表面的炎症区域。消化道的长度以及内窥镜检查获得的大量图像使得传统检测方法既耗时又费力。近年来，深度学习架构在内窥镜图像分类方面取得了更好的效果。然而，胃肠道不同部位之间的视觉相似性给有效的疾病检测带来了挑战。这项工作提出了一种新颖的内窥镜图像分类系统，通过卷积神经网络（CNN）专注于特征挖掘。所展示的模型是通过将最先进的架构（即EfficientNet B0）与名为Effimix的定制CNN架构相结合构建而成。所提出的Effimix模型采用挤压和激励层以及自归一化激活层的组合，用于胃肠道疾病的精确分类。在HyperKvasir数据集上的实验观察证实了所提出架构用于内窥镜图像分类的有效性。所提出的模型准确率达到97.99%，F1分数、精确率和召回率分别为97%、97%和98%，与现有工作相比显著更高。