Pan Jiating, Liang Lishi, Sun Peng, Liang Yongbo, Zhu Jianming, Chen Zhencheng
School of Life and Environmental Science, Guilin University of Electronic Technology, Guilin, 541004, China.
School of Electronic Engineering and Automation, Guilin University of Electronic Technology, Guilin, 541004, China.
BMC Med Imaging. 2025 May 27;25(1):193. doi: 10.1186/s12880-025-01725-x.
Lung cancer is a life-threatening disease that poses a significant risk to human health. Accurate differentiation between benign and malignant lung nodules, based on computed tomography (CT), is crucial to assess lung health. Developing an automated computer-aided diagnostic method for this differentiation is essential. We introduced a streamlined 3D model structure to solve the problems of 2D models cannot extract spatial information effectively and 3D models have high complexity and large occupation of computing resources.
We proposed an MSA (multiple self-attention-based) model to address the limitations of 2D models in extracting spatial information effectively and the high complexity associated with 3D models. Our approach introduced the 3D RTConvBlock, which employed multiple self-attention mechanisms for the extraction of spatial features. This enabled the extraction of specific spatial feature information by combining local features, global information, and dependencies between features.
The MSA model demonstrates exceptional performance with an accuracy of 0.953, a sensitivity of 0.963, and an AUC (area under curve) of 0.993 in the LUNA16 dataset, which is higher than state-of-the-art methods. Compared with existing 2D models, we extract spatial information features better, resulting in higher accuracy.
These results have significant implications for enhancing the accuracy and reliability of lung nodule classification, providing robust auxiliary support for physicians diagnosing lung diseases.
肺癌是一种危及生命的疾病,对人类健康构成重大风险。基于计算机断层扫描(CT)准确区分良性和恶性肺结节对于评估肺部健康至关重要。开发一种用于这种区分的自动化计算机辅助诊断方法至关重要。我们引入了一种简化的3D模型结构,以解决2D模型无法有效提取空间信息以及3D模型具有高复杂性和大量计算资源占用的问题。
我们提出了一种基于多自注意力(MSA)的模型,以解决2D模型在有效提取空间信息方面的局限性以及与3D模型相关的高复杂性。我们的方法引入了3D RTConvBlock,它采用多个自注意力机制来提取空间特征。这通过组合局部特征、全局信息和特征之间的依赖性来实现特定空间特征信息的提取。
在LUNA16数据集中,MSA模型表现出卓越的性能,准确率为0.953,灵敏度为0.963,曲线下面积(AUC)为0.993,高于现有最先进的方法。与现有的2D模型相比,我们能更好地提取空间信息特征,从而获得更高的准确率。
这些结果对于提高肺结节分类的准确性和可靠性具有重要意义,为医生诊断肺部疾病提供了有力的辅助支持。