IEEE J Biomed Health Inform. 2017 Jan;21(1):65-75. doi: 10.1109/JBHI.2016.2637004. Epub 2016 Dec 7.
Automated polyp detection in colonoscopy videos has been demonstrated to be a promising way for colorectal cancer prevention and diagnosis. Traditional manual screening is time consuming, operator dependent, and error prone; hence, automated detection approach is highly demanded in clinical practice. However, automated polyp detection is very challenging due to high intraclass variations in polyp size, color, shape, and texture, and low interclass variations between polyps and hard mimics. In this paper, we propose a novel offline and online three-dimensional (3-D) deep learning integration framework by leveraging the 3-D fully convolutional network (3D-FCN) to tackle this challenging problem. Compared with the previous methods employing hand-crafted features or 2-D convolutional neural network, the 3D-FCN is capable of learning more representative spatio-temporal features from colonoscopy videos, and hence has more powerful discrimination capability. More importantly, we propose a novel online learning scheme to deal with the problem of limited training data by harnessing the specific information of an input video in the learning process. We integrate offline and online learning to effectively reduce the number of false positives generated by the offline network and further improve the detection performance. Extensive experiments on the dataset of MICCAI 2015 Challenge on Polyp Detection demonstrated the better performance of our method when compared with other competitors.
结肠镜检查视频中的息肉自动检测已被证明是预防和诊断结直肠癌的一种很有前景的方法。传统的人工筛查耗时、依赖操作人员且容易出错;因此,临床实践中对自动检测方法有很高的需求。然而,由于息肉在大小、颜色、形状和纹理方面存在高度的类内变化,以及息肉与难以区分的类似物之间的类间变化较小,息肉自动检测极具挑战性。在本文中,我们提出了一种新颖的离线和在线三维(3-D)深度学习集成框架,通过利用三维全卷积网络(3D-FCN)来解决这一具有挑战性的问题。与以往采用手工特征或二维卷积神经网络的方法相比,3D-FCN能够从结肠镜检查视频中学习更具代表性的时空特征,因此具有更强的辨别能力。更重要的是,我们提出了一种新颖的在线学习方案,通过在学习过程中利用输入视频的特定信息来处理训练数据有限的问题。我们将离线学习和在线学习相结合,有效减少了离线网络产生的误报数量,并进一步提高了检测性能。在MICCAI 2015息肉检测挑战赛数据集上进行的大量实验表明,与其他竞争对手相比,我们的方法具有更好的性能。