Yan Tao, Qin Ye Ying, Wong Pak Kin, Ren Hao, Wong Chi Hong, Yao Liang, Hu Ying, Chan Cheok I, Gao Shan, Chan Pui Pun
School of Mechanical Engineering, Hubei University of Arts and Science, Xiangyang 441053, China.
Department of Electromechanical Engineering, University of Macau, Taipa, Macau 999078, China.
Bioengineering (Basel). 2023 Jul 5;10(7):806. doi: 10.3390/bioengineering10070806.
Convolutional neural networks (CNNs) have received increased attention in endoscopic images due to their outstanding advantages. Clinically, some gastric polyps are related to gastric cancer, and accurate identification and timely removal are critical. CNN-based semantic segmentation can delineate each polyp region precisely, which is beneficial to endoscopists in the diagnosis and treatment of gastric polyps. At present, just a few studies have used CNN to automatically diagnose gastric polyps, and studies on their semantic segmentation are lacking. Therefore, we contribute pioneering research on gastric polyp segmentation in endoscopic images based on CNN. Seven classical semantic segmentation models, including U-Net, UNet++, DeepLabv3, DeepLabv3+, Pyramid Attention Network (PAN), LinkNet, and Muti-scale Attention Net (MA-Net), with the encoders of ResNet50, MobineNetV2, or EfficientNet-B1, are constructed and compared based on the collected dataset. The integrated evaluation approach to ascertaining the optimal CNN model combining both subjective considerations and objective information is proposed since the selection from several CNN models is difficult in a complex problem with conflicting multiple criteria. UNet++ with the MobineNet v2 encoder obtains the best scores in the proposed integrated evaluation method and is selected to build the automated polyp-segmentation system. This study discovered that the semantic segmentation model has a high clinical value in the diagnosis of gastric polyps, and the integrated evaluation approach can provide an impartial and objective tool for the selection of numerous models. Our study can further advance the development of endoscopic gastrointestinal disease identification techniques, and the proposed evaluation technique has implications for mathematical model-based selection methods for clinical technologies.
卷积神经网络(CNN)因其突出优势在内窥镜图像中受到越来越多的关注。临床上,一些胃息肉与胃癌相关,准确识别并及时切除至关重要。基于CNN的语义分割能够精确勾勒出每个息肉区域,这有助于内镜医师对胃息肉进行诊断和治疗。目前,仅有少数研究使用CNN自动诊断胃息肉,且缺乏对其语义分割的研究。因此,我们开展了基于CNN的内镜图像胃息肉分割的开创性研究。基于收集到的数据集,构建并比较了七种经典语义分割模型,包括U-Net、UNet++、DeepLabv3、DeepLabv3+、金字塔注意力网络(PAN)、LinkNet和多尺度注意力网络(MA-Net),其编码器采用ResNet50、MobineNetV2或EfficientNet-B1。由于在具有多个相互冲突标准的复杂问题中,从多个CNN模型中进行选择较为困难,因此提出了一种综合评估方法来确定结合主观考量和客观信息的最优CNN模型。在提出的综合评估方法中,带有MobineNet v2编码器的UNet++获得了最佳分数,并被选来构建自动息肉分割系统。本研究发现语义分割模型在胃息肉诊断中具有较高的临床价值,且综合评估方法可为众多模型的选择提供公正客观的工具。我们的研究可进一步推动内镜下胃肠疾病识别技术的发展,且所提出的评估技术对基于数学模型的临床技术选择方法具有启示意义。