Yang Yang, Li Xiaoqin, Fu Jipeng, Han Zhenbo, Gao Bin
Faculty of Environment and Life, Beijing University of Technology, Beijing, China.
Med Phys. 2023 Mar;50(3):1905-1916. doi: 10.1002/mp.16221. Epub 2023 Feb 1.
Early screening is crucial to improve the survival rate and recovery rate of lung cancer patients. Computer-aided diagnosis system (CAD) is a powerful tool to assist clinicians in early diagnosis. Lung nodules are characterized by spatial heterogeneity. However, many attempts use the two-dimensional multi-view (MV) framework to learn and simply integrate multiple view features. These methods suffer from the problems of not capturing the spatial characteristics effectively and ignoring the variability of multiple views. In this paper, we propose a three-dimensional MV convolutional neural network (3D MVCNN) framework and embed the squeeze-and-excitation (SE) module in it to further address the variability of each view in the MV framework.
First, the 3D multiple view samples of lung nodules are extracted by the spatial sampling method, and a 3D CNN is established to extract 3D abstract features. Second, build a 3D MVCNN framework according to the 3D multiple view samples and 3D CNN. This framework can learn more features of different views of lung nodules, taking into account the characteristics of spatial heterogeneity of lung nodules. Finally, to further address the variability of each view in the MV framework, a 3D MVSECNN model is constructed by introducing a SE module in the feature fusion stage. For training and testing purposes we used independent subsets of the public LIDC-IDRI dataset.
For the LIDC-IDRI dataset, this study achieved 96.04% accuracy and 98.59% sensitivity in the binary classification, and 87.76% accuracy in the ternary classification, which was higher than other state-of-the-art studies. The consistency score of 0.948 between the model predictions and pathological diagnosis was significantly higher than that between the clinician's annotations and pathological diagnosis.
The results show that our proposed method can effectively learn the spatial heterogeneity of nodules and solve the problem of multiple view variability. Moreover, the consistency analysis indicates that our method can provide clinicians with more accurate results of benign-malignant lung nodule classification for auxiliary diagnosis, which is important for assisting clinicians in clinical diagnosis.
早期筛查对于提高肺癌患者的生存率和康复率至关重要。计算机辅助诊断系统(CAD)是协助临床医生进行早期诊断的有力工具。肺结节具有空间异质性特征。然而,许多尝试使用二维多视图(MV)框架来学习并简单整合多视图特征。这些方法存在无法有效捕捉空间特征以及忽略多视图变异性的问题。在本文中,我们提出了一种三维MV卷积神经网络(3D MVCNN)框架,并在其中嵌入挤压激励(SE)模块,以进一步解决MV框架中每个视图的变异性问题。
首先,通过空间采样方法提取肺结节的三维多视图样本,并建立一个三维卷积神经网络来提取三维抽象特征。其次,根据三维多视图样本和三维卷积神经网络构建一个三维MVCNN框架。该框架能够学习到肺结节不同视图的更多特征,同时考虑到肺结节的空间异质性特征。最后,为了进一步解决MV框架中每个视图的变异性问题,在特征融合阶段引入SE模块构建了一个三维MVSECNN模型。为了进行训练和测试,我们使用了公共LIDC-IDRI数据集的独立子集。
对于LIDC-IDRI数据集,本研究在二分类中达到了96.04%的准确率和98.59%的灵敏度,在三分类中达到了87.76%的准确率,高于其他现有研究。模型预测与病理诊断之间的一致性得分0.948显著高于临床医生注释与病理诊断之间的一致性得分。
结果表明,我们提出的方法能够有效地学习结节的空间异质性并解决多视图变异性问题。此外,一致性分析表明,我们的方法可以为临床医生提供更准确的肺结节良恶性分类结果用于辅助诊断,这对于协助临床医生进行临床诊断具有重要意义。