Xiang Tiange, Zhang Chaoyi, Wang Xinyi, Song Yang, Liu Dongnan, Huang Heng, Cai Weidong
School of Computer Science, University of Sydney, Australia.
School of Computer Science, University of Sydney, Australia.
Med Image Anal. 2022 May;78:102420. doi: 10.1016/j.media.2022.102420. Epub 2022 Mar 16.
U-Net, as an encoder-decoder architecture with forward skip connections, has achieved promising results in various medical image analysis tasks. Many recent approaches have also extended U-Net with more complex building blocks, which typically increase the number of network parameters considerably. Such complexity makes the inference stage highly inefficient for clinical applications. Towards an effective yet economic segmentation network design, in this work, we propose backward skip connections that bring decoded features back to the encoder. Our design can be jointly adopted with forward skip connections in any encoder-decoder architecture forming a recurrence structure without introducing extra parameters. With the backward skip connections, we propose a U-Net based network family, namely Bi-directional O-shape networks, which set new benchmarks on multiple public medical imaging segmentation datasets. On the other hand, with the most plain architecture (BiO-Net), network computations inevitably increase along with the pre-set recurrence time. We have thus studied the deficiency bottleneck of such recurrent design and propose a novel two-phase Neural Architecture Search (NAS) algorithm, namely BiX-NAS, to search for the best multi-scale bi-directional skip connections. The ineffective skip connections are then discarded to reduce computational costs and speed up network inference. The finally searched BiX-Net yields the least network complexity and outperforms other state-of-the-art counterparts by large margins. We evaluate our methods on both 2D and 3D segmentation tasks in a total of six datasets. Extensive ablation studies have also been conducted to provide a comprehensive analysis for our proposed methods.
U-Net作为一种具有前向跳跃连接的编码器-解码器架构,在各种医学图像分析任务中取得了不错的成果。最近的许多方法还使用更复杂的构建块对U-Net进行了扩展,这通常会大幅增加网络参数的数量。这种复杂性使得推理阶段在临床应用中效率极低。为了设计一个高效且经济的分割网络,在这项工作中,我们提出了反向跳跃连接,将解码后的特征带回编码器。我们的设计可以与任何编码器-解码器架构中的前向跳跃连接联合采用,形成一种循环结构,而无需引入额外参数。借助反向跳跃连接,我们提出了一个基于U-Net的网络家族,即双向O形网络,它在多个公共医学影像分割数据集上设定了新的基准。另一方面,对于最基本的架构(BiO-Net),网络计算不可避免地会随着预设的循环次数增加。因此,我们研究了这种循环设计的缺陷瓶颈,并提出了一种新颖的两阶段神经架构搜索(NAS)算法,即BiX-NAS,以搜索最佳的多尺度双向跳跃连接。然后丢弃无效的跳跃连接,以降低计算成本并加快网络推理速度。最终搜索得到的BiX-Net具有最低的网络复杂度,并大幅优于其他现有同类方法。我们在总共六个数据集中的2D和3D分割任务上评估了我们的方法。还进行了广泛的消融研究,以对我们提出的方法进行全面分析。