Institute of Smart Computer Technologies, Riga Technical University, LV-1658 Riga, Latvia.
Institute of Atomic Physics and Spectroscopy, University of Latvia, LV-1586 Riga, Latvia.
Sensors (Basel). 2022 Jan 27;22(3):990. doi: 10.3390/s22030990.
U-Net is the most cited and widely-used deep learning model for biomedical image segmentation. In this paper, we propose a new enhanced version of a ubiquitous U-Net architecture, which improves upon the original one in terms of generalization capabilities, while addressing several immanent shortcomings, such as constrained resolution and non-resilient receptive fields of the main pathway. Our novel multi-path architecture introduces a notion of an individual receptive field pathway, which is merged with other pathways at the bottom-most layer by concatenation and subsequent application of Layer Normalization and Spatial Dropout, which can improve generalization performance for small datasets. In general, our experiments show that the proposed multi-path architecture outperforms other state-of-the-art approaches that embark on similar ideas of pyramid structures, skip-connections, and encoder-decoder pathways. A significant improvement of the Dice similarity coefficient is attained at our proprietary colony-forming unit dataset, where a score of 0.809 was achieved for the foreground class.
U-Net 是最常被引用和广泛应用于生物医学图像分割的深度学习模型。在本文中,我们提出了一种普遍的 U-Net 架构的新增强版本,在泛化能力方面优于原始版本,同时解决了一些内在的缺点,例如主要路径的受限分辨率和非弹性感受野。我们的新的多路径架构引入了单个感受野路径的概念,该概念通过连接在最底层与其他路径合并,并随后应用层归一化和空间丢弃,这可以提高小数据集的泛化性能。总的来说,我们的实验表明,所提出的多路径架构优于其他采用类似金字塔结构、跳过连接和编码器-解码器路径的思想的最新方法。在我们专有的集落形成单位数据集上,我们实现了显著的改进,在前景类中达到了 0.809 的 Dice 相似系数。