Asan Institute for Life Sciences, Asan Medical Center, 05505 Seoul, Republic of Korea.
Division of Cardiology, University of Ulsan College of Medicine, Asan Medical Center, 05505 Seoul, Republic of Korea.
Neural Netw. 2020 Aug;128:216-233. doi: 10.1016/j.neunet.2020.05.002. Epub 2020 May 19.
In this paper, we proposed nested encoder-decoder architecture named T-Net. T-Net consists of several small encoder-decoders for each block constituting convolutional network. T-Net overcomes the limitation that U-Net can only have a single set of the concatenate layer between encoder and decoder block. To be more precise, the U-Net symmetrically forms the concatenate layers, so the low-level feature of the encoder is connected to the latter part of the decoder, and the high-level feature is connected to the beginning of the decoder. T-Net arranges the pooling and up-sampling appropriately during the encoding process, and likewise during the decoding process so that feature-maps of various sizes are obtained in a single block. As a result, all features from the low-level to the high-level extracted from the encoder are delivered from the beginning of the decoder to predict a more accurate mask. We evaluated T-Net for the problem of segmenting three main vessels in coronary angiography images. The experiment consisted of a comparison of U-Net and T-Nets under the same conditions, and an optimized T-Net for the main vessel segmentation. As a result, T-Net recorded a Dice Similarity Coefficient score (DSC) of 83.77%, 10.69% higher than that of U-Net, and the optimized T-Net recorded a DSC of 88.97% which was 15.89% higher than that of U-Net. In addition, we visualized the weight activation of the convolutional layer of T-Net and U-Net to show that T-Net actually predicts the mask from earlier decoders. Therefore, we expect that T-Net can be effectively applied to other similar medical image segmentation problems.
在本文中,我们提出了一种名为 T-Net 的嵌套编码器-解码器架构。T-Net 由构成卷积网络的每个块的几个小型编码器-解码器组成。T-Net 克服了 U-Net 只能在编码器和解码器块之间具有单个连接层的限制。更准确地说,U-Net 对称地形成连接层,因此编码器的低级别特征连接到解码器的后部分,高级特征连接到解码器的开头。T-Net 在编码过程中适当地进行池化和上采样,在解码过程中也是如此,以便在单个块中获得各种大小的特征图。结果,从编码器中提取的从低级到高级的所有特征都从解码器的开头传递,以预测更准确的掩模。我们评估了 T-Net 在冠状动脉造影图像中分割三个主要血管的问题。该实验包括在相同条件下比较 U-Net 和 T-Nets,以及针对主要血管分割的优化 T-Net。结果,T-Net 的 Dice 相似性系数(DSC)得分为 83.77%,比 U-Net 高 10.69%,优化后的 T-Net 的 DSC 得分为 88.97%,比 U-Net 高 15.89%。此外,我们可视化了 T-Net 和 U-Net 的卷积层的权重激活,以表明 T-Net 实际上从较早的解码器预测掩模。因此,我们期望 T-Net 可以有效地应用于其他类似的医学图像分割问题。