Xiang Shiyu, Wei Lisheng, Hu Kaifeng
School of Electrical EngineeringAnhui Polytechnic University, Wuhu 241000, China.
Anhui Key Laboratory of Electric Drive and Control, Wuhu 241000, China.
J Cancer. 2024 Jan 1;15(1):41-53. doi: 10.7150/jca.88684. eCollection 2024.
To address the problems that the current polyp segmentation model is complicated and the segmentation accuracy needs to be further improved, a lightweight polyp segmentation network model Li-DeepLabV3+ is proposed. Firstly, the optimized MobileNetV2 network is used as the backbone network to reduce the model complexity. Secondly, an improved simple pyramid pooling module is used to replace the original Atrous Spatial Pyramid Pooling structure, which improves the model training efficiency of the model while reducing the model parameters. Finally, to enhance the feature representation, in the feature fusion module, the low-level feature and the high-level feature are fused using the improved Unified Attention Fusion Module, which applies both channel and spatial attention to enrich the fused features, thus obtaining more boundary information. The model was combined with transfer learning for training and validation on the CVC-ClinicDB and Kvasir SEG datasets, and the generalization of the model was verified across the datasets. The experiment results show that the Li-DeepLabV3+ model has superior advantages in segmentation accuracy and segmentation speed, and has certain generalization abilities.
针对当前息肉分割模型复杂且分割精度有待进一步提高的问题,提出了一种轻量级息肉分割网络模型Li-DeepLabV3+。首先,使用优化后的MobileNetV2网络作为骨干网络以降低模型复杂度。其次,采用改进的简单金字塔池化模块替代原始的空洞空间金字塔池化结构,在减少模型参数的同时提高了模型训练效率。最后,为增强特征表示,在特征融合模块中,使用改进的统一注意力融合模块融合低级特征和高级特征,该模块同时应用通道注意力和空间注意力来丰富融合特征,从而获得更多边界信息。该模型结合迁移学习在CVC-ClinicDB和Kvasir SEG数据集上进行训练和验证,并验证了模型在不同数据集上的泛化能力。实验结果表明,Li-DeepLabV3+模型在分割精度和分割速度方面具有显著优势,且具有一定的泛化能力。