School of Artificial Intelligence, Beijing University of Posts and Telecommunications, Beijing, 100876, China.
School of Artificial Intelligence, Beijing University of Posts and Telecommunications, Beijing, 100876, China.
Med Image Anal. 2024 Oct;97:103302. doi: 10.1016/j.media.2024.103302. Epub 2024 Aug 10.
Semi-supervised medical image segmentation (SSMIS) has witnessed substantial advancements by leveraging limited labeled data and abundant unlabeled data. Nevertheless, existing state-of-the-art (SOTA) methods encounter challenges in accurately predicting labels for the unlabeled data, giving rise to disruptive noise during training and susceptibility to erroneous information overfitting. Moreover, applying perturbations to inaccurate predictions further impedes consistent learning. To address these concerns, we propose a novel cross-head mutual mean-teaching network (CMMT-Net) incorporated weak-strong data augmentations, thereby benefiting both co-training and consistency learning. More concretely, our CMMT-Net extends the cross-head co-training paradigm by introducing two auxiliary mean teacher models, which yield more accurate predictions and provide supplementary supervision. The predictions derived from weakly augmented samples generated by one mean teacher are leveraged to guide the training of another student with strongly augmented samples. Furthermore, two distinct yet synergistic data perturbations at the pixel and region levels are introduced. We propose mutual virtual adversarial training (MVAT) to smooth the decision boundary and enhance feature representations, and a cross-set CutMix strategy to generate more diverse training samples for capturing inherent structural data information. Notably, CMMT-Net simultaneously implements data, feature, and network perturbations, amplifying model diversity and generalization performance. Experimental results on three publicly available datasets indicate that our approach yields remarkable improvements over previous SOTA methods across various semi-supervised scenarios. The code is available at https://github.com/Leesoon1984/CMMT-Net.
半监督医学图像分割 (SSMIS) 通过利用有限的标记数据和大量的未标记数据取得了重大进展。然而,现有的最先进 (SOTA) 方法在准确预测未标记数据的标签方面遇到了挑战,在训练过程中产生了破坏性的噪声,并容易受到错误信息的过度拟合。此外,对不准确的预测进行扰动进一步阻碍了一致的学习。为了解决这些问题,我们提出了一种新的跨头互均值教师网络 (CMMT-Net),结合了弱强数据增强,从而有利于共同训练和一致性学习。更具体地说,我们的 CMMT-Net 通过引入两个辅助均值教师模型扩展了跨头协同训练范例,从而产生更准确的预测并提供补充监督。一个均值教师生成的弱增强样本的预测被用来指导另一个具有强增强样本的学生的训练。此外,引入了两种不同但协同的像素级和区域级数据扰动。我们提出了互虚拟对抗训练 (MVAT) 来平滑决策边界并增强特征表示,以及一种跨集 CutMix 策略来生成更多多样化的训练样本,以捕捉内在的结构数据信息。值得注意的是,CMMT-Net 同时实现了数据、特征和网络的扰动,增强了模型的多样性和泛化性能。在三个公开可用的数据集上的实验结果表明,我们的方法在各种半监督场景下都优于以前的 SOTA 方法。代码可在 https://github.com/Leesoon1984/CMMT-Net 获得。