Department of Electrical and Electronic EngineeringBangladesh University of Engineering and Technology Dhaka 1000 Bangladesh.
Department of Electrical EngineeringThe University of Texas at Dallas Richardson TX 75080 USA.
IEEE J Transl Eng Health Med. 2022 Aug 16;10:3300108. doi: 10.1109/JTEHM.2022.3198819. eCollection 2022.
: The emergence of wireless capsule endoscopy (WCE) has presented a viable non-invasive mean of identifying gastrointestinal diseases in the field of clinical gastroenterology. However, to overcome its extended time of manual inspection, a computer aided automatic detection system is getting vast popularity. In this case, major challenges are low resolution and lack of regional context in images extracted from WCE videos. : For tackling these challenges, in this paper a convolution neural network (CNN) based architecture, namely RAt-CapsNet, is proposed that reliably employs regional information and attention mechanism to classify abnormalities from WCE video data. The proposed RAt-CapsNet consists of two major pipelines: Compression Pipeline and Regional Correlative Pipeline. In the compression pipeline, an encoder module is designed using a Volumetric Attention Mechanism which provides 3D enhancement to feature maps using spatial domain condensation as well as channel-wise filtering for preserving relevant structural information of images. On the other hand, the regional correlative pipeline consists of Pyramid Feature Extractor which operates on image driven feature vectors to generalize and propagate local relationships of pixels from WCE abnormalities with respect to the normal healthy surrounding. The feature vectors generated by the pipelines are then accumulated to formulate a classification standpoint. : Promising computational accuracy of mean 98.51% in binary class and over 95.65% in multi-class are obtained through extensive experimentation on a highly unbalanced public dataset with over 47 thousand labelled. : This outcome in turn supports the efficacy of the proposed methodology as a noteworthy WCE abnormality detection as well as diagnostic system.
无线胶囊内镜 (WCE) 的出现为临床胃肠病学领域的胃肠道疾病提供了一种可行的非侵入性诊断方法。然而,为了克服其手动检查时间过长的问题,计算机辅助自动检测系统越来越受到欢迎。在这种情况下,主要的挑战是从 WCE 视频中提取的图像分辨率低且缺乏区域上下文。
为了解决这些挑战,本文提出了一种基于卷积神经网络 (CNN) 的架构,即 RAt-CapsNet,该架构可靠地利用区域信息和注意力机制,从 WCE 视频数据中分类异常。所提出的 RAt-CapsNet 由两个主要的流水线组成:压缩流水线和区域相关流水线。在压缩流水线中,设计了一个使用体积注意力机制的编码器模块,该模块通过空间域凝聚和通道滤波为特征图提供 3D 增强,以保留图像的相关结构信息。另一方面,区域相关流水线由金字塔特征提取器组成,该提取器基于图像驱动的特征向量操作,以概括和传播 WCE 异常相对于正常健康周围的像素的局部关系。然后将流水线生成的特征向量累积起来,形成一个分类观点。
通过在一个高度不平衡的公共数据集上进行广泛的实验,该数据集包含超过 47000 个标记,获得了平均 98.51%的二进制类和超过 95.65%的多类的有希望的计算准确性。
这一结果反过来支持了该方法作为一种有价值的 WCE 异常检测和诊断系统的有效性。