Wu Xing, Tao Chenjie, Li Zhi, Zhang Jian, Sun Qun, Han Xianhua, Chen Yanwei
School of Computer Engineering and Science, Shanghai University, Shanghai 200444, P. R. China.
Shanghai Institute for Advanced Communication and Data Science, Shanghai University, Shanghai 200444, P. R. China.
Sheng Wu Yi Xue Gong Cheng Xue Za Zhi. 2024 Apr 25;41(2):220-227. doi: 10.7507/1001-5515.202302024.
In computer-aided medical diagnosis, obtaining labeled medical image data is expensive, while there is a high demand for model interpretability. However, most deep learning models currently require a large amount of data and lack interpretability. To address these challenges, this paper proposes a novel data augmentation method for medical image segmentation. The uniqueness and advantages of this method lie in the utilization of gradient-weighted class activation mapping to extract data efficient features, which are then fused with the original image. Subsequently, a new channel weight feature extractor is constructed to learn the weights between different channels. This approach achieves non-destructive data augmentation effects, enhancing the model's performance, data efficiency, and interpretability. Applying the method of this paper to the Hyper-Kvasir dataset, the intersection over union (IoU) and Dice of the U-net were improved, respectively; and on the ISIC-Archive dataset, the IoU and Dice of the DeepLabV3+ were also improved respectively. Furthermore, even when the training data is reduced to 70 %, the proposed method can still achieve performance that is 95 % of that achieved with the entire dataset, indicating its good data efficiency. Moreover, the data-efficient features used in the method have interpretable information built-in, which enhances the interpretability of the model. The method has excellent universality, is plug-and-play, applicable to various segmentation methods, and does not require modification of the network structure, thus it is easy to integrate into existing medical image segmentation method, enhancing the convenience of future research and applications.
在计算机辅助医学诊断中,获取带标注的医学图像数据成本高昂,同时对模型可解释性的需求却很高。然而,当前大多数深度学习模型需要大量数据且缺乏可解释性。为应对这些挑战,本文提出了一种用于医学图像分割的新型数据增强方法。该方法的独特性和优势在于利用梯度加权类激活映射来提取数据有效特征,然后将这些特征与原始图像融合。随后,构建了一个新的通道权重特征提取器来学习不同通道之间的权重。这种方法实现了无损数据增强效果,提升了模型的性能、数据效率和可解释性。将本文方法应用于Hyper-Kvasir数据集时,U-net的交并比(IoU)和Dice系数分别得到了提升;在ISIC-Archive数据集上,DeepLabV3+的IoU和Dice系数也分别得到了提升。此外,即使将训练数据减少到70%,所提方法仍能达到使用整个数据集时95%的性能,表明其具有良好的数据效率。而且,该方法中使用的数据有效特征内置了可解释信息,增强了模型的可解释性。该方法具有出色的通用性,即插即用,适用于各种分割方法,无需修改网络结构,因此易于集成到现有的医学图像分割方法中,提高了未来研究和应用的便利性。