Yi Sanli, Zhang Gang, He Jianfeng
School of Information Engineering and Automation, Kunming University of Scienceand Technology, Kunming 650500, P. R. China.
Key Laboratory of Computer Technology Application of Yunnan Province, Kunming 650500, P. R. China.
Sheng Wu Yi Xue Gong Cheng Xue Za Zhi. 2022 Oct 25;39(5):937-944. doi: 10.7507/1001-5515.202110073.
Cutaneous malignant melanoma is a common malignant tumor. Accurate segmentation of the lesion area is extremely important for early diagnosis of the disease. In order to achieve more effective and accurate segmentation of skin lesions, a parallel network architecture based on Transformer is proposed in this paper. This network is composed of two parallel branches: the former is the newly constructed multiple residual frequency channel attention network (MFC), and the latter is the visual transformer network (ViT). First, in the MFC network branch, the multiple residual module and the frequency channel attention module (FCA) module are fused to improve the robustness of the network and enhance the capability of extracting image detailed features. Second, in the ViT network branch, multiple head self-attention (MSA) in Transformer is used to preserve the global features of the image. Finally, the feature information extracted from the two branches are combined in parallel to realize image segmentation more effectively. To verify the proposed algorithm, we conducted experiments on the dermoscopy image dataset published by the International Skin Imaging Collaboration (ISIC) in 2018. The results show that the intersection-over-union (IoU) and Dice coefficients of the proposed algorithm achieve 90.15% and 94.82%, respectively, which are better than the latest skin melanoma segmentation networks. Therefore, the proposed network can better segment the lesion area and provide dermatologists with more accurate lesion data.
皮肤恶性黑色素瘤是一种常见的恶性肿瘤。准确分割病变区域对于该疾病的早期诊断极为重要。为了实现对皮肤病变更有效、准确的分割,本文提出了一种基于Transformer的并行网络架构。该网络由两个并行分支组成:前者是新构建的多残差频率通道注意力网络(MFC),后者是视觉Transformer网络(ViT)。首先,在MFC网络分支中,将多残差模块与频率通道注意力模块(FCA)融合,以提高网络的鲁棒性并增强提取图像细节特征的能力。其次,在ViT网络分支中,使用Transformer中的多头自注意力(MSA)来保留图像的全局特征。最后,将从两个分支提取的特征信息进行并行组合,以更有效地实现图像分割。为验证所提算法,我们在国际皮肤影像协作组织(ISIC)2018年发布的皮肤镜图像数据集上进行了实验。结果表明,所提算法的交并比(IoU)和Dice系数分别达到了90.15%和94.82%,优于最新的皮肤黑色素瘤分割网络。因此,所提网络能够更好地分割病变区域,并为皮肤科医生提供更准确的病变数据。