Department of Computer Science and Engineering, Shaoxing University, Shaoxing, 312000, PR China; Cancer Center, Department of Gastroenterology, Zhejiang Provincial People's Hospital (Affiliated People's Hospital, Hangzhou Medical College), Hangzhou, 310014, PR China; Information Technology R&D Innovation Center of Peking University, Shaoxing, 312000, PR China.
Department of Computer Science and Engineering, Shaoxing University, Shaoxing, 312000, PR China.
Comput Biol Med. 2023 Jun;160:107028. doi: 10.1016/j.compbiomed.2023.107028. Epub 2023 May 10.
Colonoscopy is the gold standard method for investigating the gastrointestinal tract. Localizing the polyps in colonoscopy images plays a vital role when doing a colonoscopy screening, and it is also quite important for the following treatment, e.g., polyp resection. Many deep learning-based methods have been applied for solving the polyp segmentation issue. However, precisely polyp segmentation is still an open issue. Considering the effectiveness of the Pyramid Pooling Transformer (P2T) in modeling long-range dependencies and capturing robust contextual features, as well as the power of pyramid pooling in extracting features, we propose a pyramid pooling based network for polyp segmentation, namely PPNet. We first adopt the P2T as the encoder for extracting more powerful features. Next, a pyramid feature fusion module (PFFM) combining the channel attention scheme is utilized for learning a global contextual feature, in order to guide the information transition in the decoder branch. Aiming to enhance the effectiveness of PPNet on feature extraction during the decoder stage layer by layer, we introduce the memory-keeping pyramid pooling module (MPPM) into each side branch of the encoder, and transmit the corresponding feature to each lower-level side branch. Experimental results conducted on five public colorectal polyp segmentation datasets are given and discussed. Our method performs better compared with several state-of-the-art polyp extraction networks, which demonstrate the effectiveness of the mechanism of pyramid pooling for colorectal polyp segmentation.
结肠镜检查是胃肠道检查的金标准方法。在进行结肠镜筛查时,对结肠镜图像中的息肉进行定位起着至关重要的作用,这对于后续的治疗(如息肉切除)也非常重要。许多基于深度学习的方法已经被应用于解决息肉分割问题。然而,精确的息肉分割仍然是一个开放的问题。考虑到 Pyramid Pooling Transformer(P2T)在建模长程依赖关系和捕捉稳健的上下文特征方面的有效性,以及金字塔池化在提取特征方面的优势,我们提出了一种基于金字塔池化的息肉分割网络,即 PPNet。我们首先采用 P2T 作为编码器来提取更强大的特征。接下来,我们利用结合通道注意力机制的金字塔特征融合模块(PFFM)来学习全局上下文特征,以指导解码器分支中的信息传递。为了增强 PPNet 在解码器阶段逐层进行特征提取的有效性,我们在编码器的每个侧支中引入了保持记忆的金字塔池化模块(MPPM),并将相应的特征传输到每个较低级别的侧支。我们在五个公共的结直肠息肉分割数据集上进行了实验,并对结果进行了讨论。与几个先进的息肉提取网络相比,我们的方法表现更好,这证明了金字塔池化在结直肠息肉分割中的机制是有效的。