Tang Guozhi, Du Lingyan, Ling Shihai, Che Yue, Chen Xin
School of Automation and Information Engineering, Sichuan University of Science and Engineering, Yibin, China.
Artificial Intelligence Key Laboratory of Sichuan Province, Yibin, China.
Quant Imaging Med Surg. 2024 Dec 5;14(12):8942-8965. doi: 10.21037/qims-24-1315. Epub 2024 Nov 29.
The accurate classification of lung nodules is critical to achieving personalized lung cancer treatment and prognosis prediction. The treatment options for lung cancer and the prognosis of patients are closely related to the type of lung nodules, but there are many types of lung nodules, and the distinctions between certain types are subtle, making accurate classification based on traditional medical imaging technology and doctor experience challenging. This study adopts a novel approach, using computed tomography (CT) radiomics to analyze the quantitative features in CT images to reveal the characteristics of lung nodules, and then employs diversity-weighted ensemble learning to enhance the accuracy of classification by integrating the predictive results of multiple models.
We extracted lung nodules from the Lung Image Database Consortium image collection (LIDC-IDRI) dataset and derived radiomics features from the nodules. For the classification tasks of seven types of lung nodules, each was split into binary classifications. Two model-building methods were employed: M1 (equal-weighted voting ensemble classifier) and M2 (diversity-weighted voting ensemble classifier). Models were evaluated using 10-fold cross-validation with metrics including the area under the receiver operating characteristic curve (AUC), accuracy, specificity, and sensitivity.
Both methods effectively completed classification tasks. The M2 method outperformed M1, particularly in classifying texture, calcification, and the benign and malignant nature of lung nodules. The AUC values of the M2 method in the four subclassifications of texture types of lung nodules were 0.9913, 0.8838, 0.9525, and 0.8845, with the corresponding accuracies of 0.9651, 0.8116, 0.9000, and 0.8284, respectively. In the classification of the degree of calcification of lung nodules, the AUC value of the M2 method was 0.9775 with an accuracy of 0.9642. In the classification of the benign and malignant nature of lung nodules, the AUC value of the M2 method was 0.8953 with an accuracy of 0.8168. The combination of CT radiomics and diversity-weighted ensemble learning effectively identifies lung nodule types, providing a novel method for the precise classification of lung nodules and aiding personalized lung cancer treatment and prognosis prediction.
The combination of CT radiomics and ensemble learning for diversity weighting can be well realized to identify the type of lung nodules.
肺结节的准确分类对于实现肺癌的个性化治疗和预后预测至关重要。肺癌的治疗方案以及患者的预后与肺结节的类型密切相关,但肺结节类型众多,某些类型之间的区别细微,基于传统医学影像技术和医生经验进行准确分类具有挑战性。本研究采用一种新方法,利用计算机断层扫描(CT)影像组学分析CT图像中的定量特征以揭示肺结节的特征,然后采用多样性加权集成学习,通过整合多个模型的预测结果来提高分类的准确性。
我们从肺部影像数据库联盟图像集(LIDC-IDRI)数据集中提取肺结节,并从结节中提取影像组学特征。对于七种类型肺结节的分类任务,每种都拆分为二元分类。采用了两种模型构建方法:M1(等权重投票集成分类器)和M2(多样性加权投票集成分类器)。使用10折交叉验证对模型进行评估,评估指标包括受试者操作特征曲线下面积(AUC)、准确率、特异性和灵敏度。
两种方法均有效完成了分类任务。M2方法优于M1,尤其在肺结节的纹理、钙化以及良恶性分类方面。M2方法在肺结节纹理类型的四个子分类中的AUC值分别为0.9913、0.8838、0.9525和0.8845,相应的准确率分别为0.9651、0.8116、0.9000和0.8284。在肺结节钙化程度的分类中,M2方法的AUC值为0.9775,准确率为0.9642。在肺结节良恶性分类中,M2方法的AUC值为0.8953,准确率为0.8168。CT影像组学与多样性加权集成学习相结合能有效识别肺结节类型,为肺结节的精确分类提供了一种新方法,有助于肺癌的个性化治疗和预后预测。
CT影像组学与多样性加权的集成学习相结合能够很好地实现对肺结节类型的识别。