Chitkara University Institute of Engineering and Technology, Chitkara University, Punjab, India.
Model Institute of Engineering and Technology, Jammu, J&K, India.
Math Biosci Eng. 2024 Aug 15;21(8):6847-6869. doi: 10.3934/mbe.2024300.
Convolutional Neural Networks (CNNs) have received substantial attention as a highly effective tool for analyzing medical images, notably in interpreting endoscopic images, due to their capacity to provide results equivalent to or exceeding those of medical specialists. This capability is particularly crucial in the realm of gastrointestinal disorders, where even experienced gastroenterologists find the automatic diagnosis of such conditions using endoscopic pictures to be a challenging endeavor. Currently, gastrointestinal findings in medical diagnosis are primarily determined by manual inspection by competent gastrointestinal endoscopists. This evaluation procedure is labor-intensive, time-consuming, and frequently results in high variability between laboratories. To address these challenges, we introduced a specialized CNN-based architecture called GastroFuse-Net, designed to recognize human gastrointestinal diseases from endoscopic images. GastroFuse-Net was developed by combining features extracted from two different CNN models with different numbers of layers, integrating shallow and deep representations to capture diverse aspects of the abnormalities. The Kvasir dataset was used to thoroughly test the proposed deep learning model. This dataset contained images that were classified according to structures (cecum, z-line, pylorus), diseases (ulcerative colitis, esophagitis, polyps), or surgical operations (dyed resection margins, dyed lifted polyps). The proposed model was evaluated using various measures, including specificity, recall, precision, F1-score, Mathew's Correlation Coefficient (MCC), and accuracy. The proposed model GastroFuse-Net exhibited exceptional performance, achieving a precision of 0.985, recall of 0.985, specificity of 0.984, F1-score of 0.997, MCC of 0.982, and an accuracy of 98.5%.
卷积神经网络 (CNN) 作为一种分析医学图像的高效工具,受到了广泛关注,特别是在解释内窥镜图像方面,因为它们能够提供与医学专家相当或更高的结果。这种能力在胃肠道疾病领域尤为重要,即使是经验丰富的胃肠病学家,使用内窥镜图像自动诊断这些疾病也具有挑战性。目前,医学诊断中的胃肠道发现主要是由有能力的胃肠内窥镜医生进行手动检查来确定。这种评估过程既劳动密集又耗时,并且经常导致实验室之间存在高度的可变性。为了解决这些挑战,我们引入了一种专门的基于 CNN 的架构,称为 GastroFuse-Net,用于从内窥镜图像中识别人类胃肠道疾病。GastroFuse-Net 是通过结合具有不同层数的两个不同 CNN 模型提取的特征构建的,将浅层和深层表示集成在一起,以捕获异常的各个方面。该深度学习模型在 Kvasir 数据集上进行了全面测试。该数据集包含根据结构(盲肠、Z 线、幽门)、疾病(溃疡性结肠炎、食管炎、息肉)或手术(染色切除边缘、染色提起的息肉)分类的图像。使用各种指标(包括特异性、召回率、精度、F1 评分、马修斯相关系数 (MCC) 和准确性)评估所提出的模型。所提出的模型 GastroFuse-Net 表现出色,达到了 0.985 的精度、0.985 的召回率、0.984 的特异性、0.997 的 F1 评分、0.982 的 MCC 和 98.5%的准确率。