Liu Ming, Jiang Jue, Wang Zenan
Hunan Key Laboratory of Nonferrous Resources and Geological Hazard Exploration, Changsha 410083, China.
Department of Medical Physics, Memorial Sloan Kettering Cancer Center, New York, NY 10065, USA.
IEEE Access. 2019;7:75058-75066. doi: 10.1109/access.2019.2921027. Epub 2019 Jun 5.
A major rise in the prevalence and influence of colorectal cancer (CRC) leads to substantially increasing healthcare costs and even death. It is widely accepted that early detection and removal of colonic polyps can prevent CRC. Detection of colonic polyps in colonoscopy videos is problematic because of complex environment of colon and various shapes of polyps. Currently, researchers indicate feasibility of Convolutional Neural Network (CNN)-based detection of polyps but better feature extractors are needed to improve detection performance. In this paper, we investigated the potential of the single shot detector (SSD) framework for detecting polyps in colonoscopy videos. SSD is a one-stage method, which uses a feed-forward CNN to produce a collection of fixed-size bounding boxes for each object from different feature maps. Three different feature extractors, including ResNet50, VGG16, and InceptionV3 were assessed. Multi-scale feature maps integrated into SSD were designed for ResNet50 and InceptionV3, respectively. We validated this method on the 2015 MICCAI polyp detection challenge datasets, compared it with teams attended the challenge, YOLOV3 and two-stage method, Faster-RCNN. Our results demonstrated that the proposed method surpassed all the teams in MICCAI challenge and YOLOV3 and was comparable with two-stage method. Especially in detection speed aspect, our proposed method outperformed all the methods, met real-time application requirement. Meanwhile, we also indicated that among all the feature extractors, InceptionV3 obtained the best result of precision and recall. In conclusion, SSD- based method achieved excellent detection performance in polyp detection and can potentially improve diagnostic accuracy and efficiency.
结直肠癌(CRC)患病率和影响力的大幅上升导致医疗成本大幅增加,甚至造成死亡。人们普遍认为,早期发现并切除结肠息肉可预防结直肠癌。由于结肠环境复杂且息肉形状各异,在结肠镜检查视频中检测结肠息肉存在问题。目前,研究人员指出基于卷积神经网络(CNN)检测息肉具有可行性,但需要更好的特征提取器来提高检测性能。在本文中,我们研究了单阶段检测器(SSD)框架在结肠镜检查视频中检测息肉的潜力。SSD是一种单阶段方法,它使用前馈卷积神经网络从不同特征图为每个对象生成一组固定大小的边界框。我们评估了三种不同的特征提取器,包括ResNet50、VGG16和InceptionV3。分别为ResNet50和InceptionV3设计了集成到SSD中的多尺度特征图。我们在2015年医学图像计算与计算机辅助干预国际会议(MICCAI)息肉检测挑战赛数据集上验证了该方法,并将其与参加挑战赛的团队、YOLOV3和两阶段方法Faster-RCNN进行了比较。我们的结果表明,所提出的方法在MICCAI挑战赛中超过了所有团队以及YOLOV3,并且与两阶段方法相当。特别是在检测速度方面,我们提出的方法优于所有方法,满足实时应用需求。同时,我们还指出,在所有特征提取器中,InceptionV3在精度和召回率方面取得了最佳结果。总之,基于SSD的方法在息肉检测中取得了优异的检测性能,并有可能提高诊断准确性和效率。