Hu Tianjiao, Lan Yihua, Zhang Yingqi, Xu Jiashu, Li Shuai, Hung Chih-Cheng
School of Artificial Intelligence and Software Engineering, Nanyang Normal University, Nanyang, 473061, China.
Henan Engineering Research Center of Intelligent Processing for Big Data of Digital Image, Nanyang, 473061, China.
Sci Rep. 2024 Dec 30;14(1):31743. doi: 10.1038/s41598-024-82877-8.
Accurate lung nodule segmentation is fundamental for the early detection of lung cancer. With the rapid development of deep learning, lung nodule segmentation models based on the encoder-decoder structure have become the mainstream research approach. However, during the encoding process, most models have limitations in extracting edge and semantic information and in capturing long-range dependencies. To address these problems, we propose a new lung nodule segmentation model, abbreviated as MCAT-Net. In this model, we construct a multi-threshold feature separation module to capture edge and texture features from different levels and specified intensities of the input image. Secondly, we introduce the coordinate attention mechanism, which allows the model to better recognize and utilize spatial information when handling long-range dependencies, enabling the deep network to maintain its sensitivity to nodule positions. Thirdly, we use the transformer to fully capture the long-range dependencies, further enhancing the global information integration of the network. The proposed method was verified on the LIDC-IDRI and LNDb datasets. The Dice similarity coefficient (DSC) values achieved were 88.29% and 78.51%, and the sensitivities were 86.33% and 75.05%, respectively. The experimental results demonstrated its high practical value for the early diagnosis of lung cancer.
准确的肺结节分割是肺癌早期检测的基础。随着深度学习的快速发展,基于编码器-解码器结构的肺结节分割模型已成为主流研究方法。然而,在编码过程中,大多数模型在提取边缘和语义信息以及捕捉长程依赖方面存在局限性。为了解决这些问题,我们提出了一种新的肺结节分割模型,简称为MCAT-Net。在该模型中,我们构建了一个多阈值特征分离模块,以从输入图像的不同级别和指定强度中捕捉边缘和纹理特征。其次,我们引入了坐标注意力机制,这使得模型在处理长程依赖时能够更好地识别和利用空间信息,使深度网络能够保持对结节位置的敏感性。第三,我们使用Transformer来充分捕捉长程依赖,进一步增强网络的全局信息整合能力。所提出的方法在LIDC-IDRI和LNDb数据集上得到了验证。获得的Dice相似系数(DSC)值分别为88.29%和78.51%,灵敏度分别为86.33%和75.05%。实验结果证明了其在肺癌早期诊断中的高实用价值。