Department of Computer Science and Engineering, The Chinese University of Hong Kong, Hong Kong, China.
Centre for Smart Health, School of Nursing, The Hong Kong Polytechnic University, Hong Kong, China.
Med Image Anal. 2017 Oct;41:40-54. doi: 10.1016/j.media.2017.05.001. Epub 2017 May 8.
While deep convolutional neural networks (CNNs) have achieved remarkable success in 2D medical image segmentation, it is still a difficult task for CNNs to segment important organs or structures from 3D medical images owing to several mutually affected challenges, including the complicated anatomical environments in volumetric images, optimization difficulties of 3D networks and inadequacy of training samples. In this paper, we present a novel and efficient 3D fully convolutional network equipped with a 3D deep supervision mechanism to comprehensively address these challenges; we call it 3D DSN. Our proposed 3D DSN is capable of conducting volume-to-volume learning and inference, which can eliminate redundant computations and alleviate the risk of over-fitting on limited training data. More importantly, the 3D deep supervision mechanism can effectively cope with the optimization problem of gradients vanishing or exploding when training a 3D deep model, accelerating the convergence speed and simultaneously improving the discrimination capability. Such a mechanism is developed by deriving an objective function that directly guides the training of both lower and upper layers in the network, so that the adverse effects of unstable gradient changes can be counteracted during the training procedure. We also employ a fully connected conditional random field model as a post-processing step to refine the segmentation results. We have extensively validated the proposed 3D DSN on two typical yet challenging volumetric medical image segmentation tasks: (i) liver segmentation from 3D CT scans and (ii) whole heart and great vessels segmentation from 3D MR images, by participating two grand challenges held in conjunction with MICCAI. We have achieved competitive segmentation results to state-of-the-art approaches in both challenges with a much faster speed, corroborating the effectiveness of our proposed 3D DSN.
虽然深度卷积神经网络(CNNs)在 2D 医学图像分割方面取得了显著的成功,但由于一些相互影响的挑战,如容积图像中复杂的解剖环境、3D 网络的优化困难和训练样本不足,使得 CNN 仍然难以从 3D 医学图像中分割重要的器官或结构。在本文中,我们提出了一种新颖而有效的 3D 全卷积网络,该网络配备了 3D 深度监督机制,以全面解决这些挑战;我们称之为 3D DSN。我们提出的 3D DSN 能够进行体积到体积的学习和推断,这可以消除冗余计算,并减轻在有限的训练数据上过度拟合的风险。更重要的是,3D 深度监督机制可以有效地应对训练 3D 深度模型时梯度消失或爆炸的优化问题,加速收敛速度,同时提高判别能力。该机制通过导出一个直接指导网络中下两层训练的目标函数来实现,从而在训练过程中抵消不稳定梯度变化的不利影响。我们还采用了全连接条件随机场模型作为后处理步骤,以细化分割结果。我们在两个典型但具有挑战性的容积医学图像分割任务上广泛验证了所提出的 3D DSN:(i)从 3D CT 扫描中分割肝脏,(ii)从 3D MR 图像中分割整个心脏和大血管,这两个任务都是在 MICCAI 联合举办的两个大型挑战赛中完成的。我们在这两个挑战中都取得了比最先进方法更有竞争力的分割结果,而且速度更快,这证明了我们提出的 3D DSN 的有效性。