Zhao Xin, Wang Wenqi
College of Information Engineering, Dalian University, Dalian 116622, China.
J Imaging. 2024 May 14;10(5):118. doi: 10.3390/jimaging10050118.
In the realm of medical image analysis, the cost associated with acquiring accurately labeled data is prohibitively high. To address the issue of label scarcity, semi-supervised learning methods are employed, utilizing unlabeled data alongside a limited set of labeled data. This paper presents a novel semi-supervised medical segmentation framework, DCCLNet (deep consistency collaborative learning UNet), grounded in deep consistent co-learning. The framework synergistically integrates consistency learning from feature and input perturbations, coupled with collaborative training between CNN (convolutional neural networks) and ViT (vision transformer), to capitalize on the learning advantages offered by these two distinct paradigms. Feature perturbation involves the application of auxiliary decoders with varied feature disturbances to the main CNN backbone, enhancing the robustness of the CNN backbone through consistency constraints generated by the auxiliary and main decoders. Input perturbation employs an MT (mean teacher) architecture wherein the main network serves as the student model guided by a teacher model subjected to input perturbations. Collaborative training aims to improve the accuracy of the main networks by encouraging mutual learning between the CNN and ViT. Experiments conducted on publicly available datasets for ACDC (automated cardiac diagnosis challenge) and Prostate datasets yielded Dice coefficients of 0.890 and 0.812, respectively. Additionally, comprehensive ablation studies were performed to demonstrate the effectiveness of each methodological contribution in this study.
在医学图像分析领域,获取准确标注数据的成本高得令人望而却步。为了解决标签稀缺的问题,人们采用半监督学习方法,将未标注数据与有限的已标注数据一起使用。本文提出了一种新颖的半监督医学分割框架DCCLNet(深度一致性协作学习U-Net),其基于深度一致性协同学习。该框架将来自特征和输入扰动的一致性学习协同整合,再加上卷积神经网络(CNN)和视觉Transformer(ViT)之间的协作训练,以利用这两种不同范式提供的学习优势。特征扰动包括将具有不同特征干扰的辅助解码器应用于主CNN主干,通过辅助解码器和主解码器生成的一致性约束增强CNN主干的鲁棒性。输入扰动采用均值教师(MT)架构,其中主网络作为学生模型,由受输入扰动的教师模型引导。协作训练旨在通过鼓励CNN和ViT之间的相互学习来提高主网络的准确性。在用于自动心脏诊断挑战(ACDC)和前列腺数据集的公开可用数据集上进行的实验分别产生了0.890和0.812的Dice系数。此外,还进行了全面的消融研究,以证明本研究中每种方法贡献的有效性。