Department of Computing Science, University of Aberdeen, Aberdeen, AB24 3UE, United Kingdom.
Department of Computing Science, University of Aberdeen, Aberdeen, AB24 3UE, United Kingdom.
Comput Biol Med. 2024 Oct;181:109047. doi: 10.1016/j.compbiomed.2024.109047. Epub 2024 Aug 24.
The performance of existing lesion semantic segmentation models has shown a steady improvement with the introduction of mechanisms like attention, skip connections, and deep supervision. However, these advancements often come at the expense of computational requirements, necessitating powerful graphics processing units with substantial video memory. Consequently, certain models may exhibit poor or non-existent performance on more affordable edge devices, such as smartphones and other point-of-care devices. To tackle this challenge, our paper introduces a lesion segmentation model with a low parameter count and minimal operations. This model incorporates polar transformations to simplify images, facilitating faster training and improved performance. We leverage the characteristics of polar images by directing the model's focus to areas most likely to contain segmentation information, achieved through the introduction of a learning-efficient polar-based contrast attention (PCA). This design utilizes Hadamard products to implement a lightweight attention mechanism without significantly increasing model parameters and complexities. Furthermore, we present a novel skip cross-channel aggregation (SCA) approach for sharing cross-channel corrections, introducing Gaussian depthwise convolution to enhance nonlinearity. Extensive experiments on the ISIC 2018 and Kvasir datasets demonstrate that our model surpasses state-of-the-art models while maintaining only about 25K parameters. Additionally, our proposed model exhibits strong generalization to cross-domain data, as confirmed through experiments on the PH dataset and CVC-Polyp dataset. In addition, we evaluate the model's performance in a mobile setting against other lightweight models. Notably, our proposed model outperforms other advanced models in terms of IoU and Dice score, and running time.
现有的病变语义分割模型的性能随着注意力机制、跳连接和深度监督等机制的引入而稳步提高。然而,这些改进通常是以计算要求为代价的,需要具有大量视频内存的强大图形处理单元。因此,某些模型在更经济实惠的边缘设备(如智能手机和其他即时护理设备)上可能表现不佳或根本无法运行。为了解决这个挑战,我们的论文提出了一个参数少、操作少的病变分割模型。该模型采用极坐标变换来简化图像,从而加快训练速度并提高性能。我们利用极坐标图像的特点,通过引入学习效率高的基于极坐标的对比注意力(PCA),引导模型的注意力集中在最有可能包含分割信息的区域。这种设计利用哈达玛积来实现轻量级注意力机制,而不会显著增加模型参数和复杂度。此外,我们提出了一种新颖的跨通道聚合(SCA)方法来共享跨通道校正,引入高斯深度卷积以增强非线性。在 ISIC 2018 和 Kvasir 数据集上的广泛实验表明,我们的模型在保持约 25K 参数的情况下,超越了最先进的模型。此外,我们提出的模型在跨域数据上具有很强的泛化能力,这在 PH 数据集和 CVC-Polyp 数据集上的实验中得到了证实。此外,我们还在移动环境中评估了模型与其他轻量级模型的性能。值得注意的是,我们提出的模型在 IoU 和 Dice 得分以及运行时间方面均优于其他先进模型。