Kuang Zhuo, Yan Zengqiang, Zhou Huiyu, Yu Li
IEEE J Biomed Health Inform. 2023 Oct;27(10):4890-4901. doi: 10.1109/JBHI.2023.3300179. Epub 2023 Oct 5.
Weakly supervised learning, releasing deep learning from highly labor-intensive pixel-wise annotations, has gained great attention, especially for medical image segmentation. With only image-level labels, pixel-wise segmentation/localization usually is achieved based on class activation maps (CAMs) containing the most discriminative regions. One common consequence of CAM-based approaches is incomplete foreground segmentation, i.e. under-segmentation/false negatives. Meanwhile, suffering from relatively limited medical imaging data, class-irrelevant tissues can hardly be suppressed during classification, resulting in incorrect background identification, i.e. over-segmentation/false positives. The above two issues are determined by the loose-constraint nature of image-level labels penalizing on the entire image space, and thus how to develop pixel-wise constraints based on image-level labels is the key for performance improvement which is under-explored. In this paper, based on unsupervised clustering, we propose a new paradigm called cluster-re-supervision to evaluate the contribution of each pixel in CAMs to final classification and thus generate pixel-wise supervision (i.e., clustering maps) for CAMs refinement on both over- and under-segmentation reduction. Furthermore, based on self-supervised learning, an inter-modality image reconstruction module, together with random masking, is designed to complement local information in feature learning which helps stabilize clustering. Experimental results on two popular public datasets demonstrate the superior performance of the proposed weakly-supervised framework for medical image segmentation. More importantly, cluster-re-supervision is independent of specific tasks and highly extendable to other applications.
弱监督学习将深度学习从高度劳动密集型的逐像素标注中解放出来,受到了广泛关注,尤其是在医学图像分割领域。仅利用图像级标签,逐像素分割/定位通常基于包含最具判别力区域的类激活映射(CAM)来实现。基于CAM的方法的一个常见后果是前景分割不完整,即分割不足/假阴性。同时,由于医学成像数据相对有限,在分类过程中几乎无法抑制与类别无关的组织,从而导致背景识别错误,即过度分割/假阳性。上述两个问题是由图像级标签在整个图像空间上惩罚的宽松约束性质决定的,因此如何基于图像级标签开发逐像素约束是性能提升的关键,而这一点尚未得到充分探索。在本文中,基于无监督聚类,我们提出了一种名为聚类再监督的新范式,以评估CAM中每个像素对最终分类的贡献,从而生成逐像素监督(即聚类映射),用于在减少过度分割和分割不足方面对CAM进行细化。此外,基于自监督学习,设计了一个跨模态图像重建模块,并结合随机掩蔽,以补充特征学习中的局部信息,这有助于稳定聚类。在两个流行的公共数据集上的实验结果证明了所提出的弱监督医学图像分割框架的优越性能。更重要的是,聚类再监督独立于特定任务,并且具有高度可扩展性,可应用于其他领域。