Hu Haigen, Shen Leizhao, Guan Qiu, Li Xiaoxin, Zhou Qianwei, Ruan Su
College of Computer Science and Technology, Zhejiang University of Technology, Hangzhou 310023, PR China.
Key Laboratory of Visual Media Intelligent Processing Technology of Zhejiang Province, Hangzhou 310023, PR China.
Pattern Recognit. 2022 Apr;124:108452. doi: 10.1016/j.patcog.2021.108452. Epub 2021 Nov 25.
Due to the irregular shapes,various sizes and indistinguishable boundaries between the normal and infected tissues, it is still a challenging task to accurately segment the infected lesions of COVID-19 on CT images. In this paper, a novel segmentation scheme is proposed for the infections of COVID-19 by enhancing supervised information and fusing multi-scale feature maps of different levels based on the encoder-decoder architecture. To this end, a deep collaborative supervision (Co-supervision) scheme is proposed to guide the network learning the features of edges and semantics. More specifically, an Edge Supervised Module (ESM) is firstly designed to highlight low-level boundary features by incorporating the edge supervised information into the initial stage of down-sampling. Meanwhile, an Auxiliary Semantic Supervised Module (ASSM) is proposed to strengthen high-level semantic information by integrating mask supervised information into the later stage. Then an Attention Fusion Module (AFM) is developed to fuse multiple scale feature maps of different levels by using an attention mechanism to reduce the semantic gaps between high-level and low-level feature maps. Finally, the effectiveness of the proposed scheme is demonstrated on four various COVID-19 CT datasets. The results show that the proposed three modules are all promising. Based on the baseline (ResUnet), using ESM, ASSM, or AFM alone can respectively increase Dice metric by 1.12%, 1.95%,1.63% in our dataset, while the integration by incorporating three models together can rise 3.97%. Compared with the existing approaches in various datasets, the proposed method can obtain better segmentation performance in some main metrics, and can achieve the best generalization and comprehensive performance.
由于正常组织与感染组织形状不规则、大小各异且边界难以区分,在CT图像上准确分割新型冠状病毒肺炎(COVID-19)的感染病灶仍是一项具有挑战性的任务。本文基于编码器-解码器架构,通过增强监督信息和融合不同层次的多尺度特征图,提出了一种针对COVID-19感染的新型分割方案。为此,提出了一种深度协同监督(Co-supervision)方案来指导网络学习边缘和语义特征。具体而言,首先设计了一个边缘监督模块(ESM),通过将边缘监督信息纳入下采样的初始阶段来突出低级边界特征。同时,提出了一个辅助语义监督模块(ASSM),通过将掩码监督信息整合到后期阶段来强化高级语义信息。然后开发了一个注意力融合模块(AFM),利用注意力机制融合不同层次的多尺度特征图,以减少高级和低级特征图之间的语义差距。最后,在四个不同的COVID-19 CT数据集中验证了所提方案的有效性。结果表明,所提出的三个模块都很有前景。基于基线(ResUnet),在我们的数据集中单独使用ESM、ASSM或AFM分别可使Dice指标提高1.12%、1.95%、1.63%,而将三个模型结合使用可提高3.97%。与各种数据集中的现有方法相比,所提方法在一些主要指标上能获得更好的分割性能,并且能实现最佳的泛化和综合性能。